onsite
DevOps/SRE Engineer - SWF - BMW TechWorks India
Site Reliability Engineer
DevOps/SRE Engineer focused on building and operating cloud‑native platforms with Kubernetes, CI/CD pipelines, and observability stacks, leveraging Docker, Python, and AWS to deliver secure, high‑velocity releases.
About the role
Key Responsibilities
- Design, deploy, and maintain Kubernetes clusters and associated infrastructure for production services.
- Build and manage end‑to‑end CI/CD pipelines using GitOps principles to enable rapid, reliable deployments.
- Implement and monitor observability solutions (metrics, logs, traces) to ensure system health and performance.
- Enforce security best practices, including image scanning, secrets management, and compliance checks across the stack.
- Troubleshoot and resolve production incidents, collaborating with cross‑functional teams to root cause and prevent recurrence.
Requirements
- Proven experience with Docker, Kubernetes, and cloud‑native tooling.
- Strong scripting skills in Python and familiarity with CI/CD tooling (GitHub Actions, GitLab CI, ArgoCD).
- Hands‑on experience with AWS services (EKS, ECS, S3, IAM, CloudWatch).
- Solid understanding of observability concepts and tools (Prometheus, Grafana, Loki, Jaeger).
- Excellent problem‑solving skills and ability to work independently in a fast‑paced environment.
Skills
dockerkubernetescicdpythonaws