remote
Associate Site Reliability Engineer - Activision
Site Reliability Engineer
Entry‑level Site Reliability Engineer focused on maintaining reliable, observable Marketing Technology services using Kubernetes, Docker, CI/CD pipelines, and AWS. Strong scripting in Python and hands‑on monitoring tools are essential.
About the role
Key Responsibilities
- Maintain and improve the reliability of Marketing Technology services in production environments.
- Deploy and manage containerized workloads on Kubernetes clusters, ensuring high availability and scalability.
- Implement and refine CI/CD pipelines to automate build, test, and deployment processes.
- Configure and monitor observability tools (e.g., Prometheus, Grafana, ELK) to detect and resolve incidents quickly.
- Collaborate with development and operations teams to troubleshoot performance issues and implement best practices.
Requirements
- Experience with container orchestration using Kubernetes and Docker.
- Proficiency in scripting with Python for automation and tooling.
- Hands‑on experience with CI/CD tools such as Jenkins, GitHub Actions, or GitLab CI.
- Familiarity with cloud platforms, preferably AWS, and related services (EKS, ECS, CloudWatch).
- Strong problem‑solving skills and a proactive approach to incident response and root cause analysis.
Skills
kubernetesdockercicdawspython