remote
Infrastructure Engineer Site Reliability Engineer - Medical Mutual of Ohio
Site Reliability Engineer
Infrastructure Engineer (SRE) responsible for designing, deploying, and maintaining scalable, highly available cloud infrastructure using AWS, Kubernetes, Docker, and Terraform, while ensuring reliability, performance, and security through CI/CD pipelines and proactive monitoring.
About the role
Key Responsibilities
- Design, implement, and manage AWS-based infrastructure, ensuring high availability and scalability for mission‑critical applications.
- Build and maintain Kubernetes clusters, Docker containers, and CI/CD pipelines to streamline deployment and release cycles.
- Use Terraform and other IaC tools to provision and version control infrastructure resources.
- Implement monitoring, alerting, and logging solutions (Prometheus, Grafana, ELK) to detect and resolve incidents quickly.
- Collaborate with development teams to optimize application performance and enforce security best practices.
Requirements
- 3+ years of experience in cloud infrastructure, preferably AWS, with hands‑on Kubernetes and Docker.
- Proficiency in Terraform, Ansible, or similar IaC tools.
- Strong scripting skills in Bash or Python for automation.
- Solid understanding of networking, load balancing, and security concepts.
- Excellent problem‑solving skills and ability to work in a fast‑paced, hybrid environment.
Skills
awskubernetesdockerterraformcicdlinux