remote
Site Reliability Engineer - Akamai
Site Reliability Engineer
Site Reliability Engineer focused on configuration management, IaC, and CI/CD pipelines, building scalable, reliable infrastructure for a global cloud platform using Python, Terraform, Ansible, Kubernetes, and major cloud providers.
About the role
Key Responsibilities
- Design, develop, and maintain infrastructure-as-code using Terraform and Ansible to provision and manage cloud resources across AWS, GCP, and Azure.
- Build and optimize CI/CD pipelines with GitHub Actions, Jenkins, or similar tools to automate application deployments and infrastructure changes.
- Collaborate with development, security, and operations teams to ensure high availability, performance, and security of services at scale.
- Implement monitoring, alerting, and logging solutions (Prometheus, Grafana, ELK) to detect and remediate incidents proactively.
- Participate in on‑call rotations, incident response, and post‑mortem analysis to continuously improve reliability.
Requirements
- 3+ years of experience in site reliability engineering or DevOps roles.
Skills
pythonterraformansiblecicdkubernetes