remote
Senior Site Reliability Engineer - Akamai
Site Reliability Engineer
Senior Site Reliability Engineer focused on designing, building, and operating scalable infrastructure using Python, Node.js, Terraform, Kubernetes, and CI/CD pipelines to ensure reliability, security, and performance for a global cloud platform.
About the role
Key Responsibilities
- Design, develop, and maintain infrastructure-as-code (IaC) using Terraform to provision and manage cloud resources at scale.
- Build and optimize CI/CD pipelines for automated testing, deployment, and rollback of services across Kubernetes clusters.
- Collaborate with cross‑functional teams to troubleshoot, monitor, and improve system reliability, security, and performance.
- Implement automation scripts in Python and Node.js to streamline operations, configuration management, and incident response.
- Participate in capacity planning, load testing, and performance tuning to support a global fleet of services.
Requirements
- 5+ years of experience in site reliability engineering or DevOps roles.
- Proficiency with Terraform, Kubernetes, and container orchestration.
- Strong scripting skills in Python and Node.js.
- Hands‑on experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI).
- Excellent problem‑solving skills and a passion for automation and scalability.
Skills
pythonnodejsterraformkubernetescicd