remote
Staff Platform Engineer - abridge
Devops Engineer
Lead the design and operation of a scalable, AI‑driven healthcare platform, driving cloud infrastructure, automation, and reliability using Python, AWS, Kubernetes, Docker, Terraform, and CI/CD pipelines.
About the role
Key Responsibilities
- Architect and maintain a highly available, secure cloud infrastructure on AWS that supports real‑time AI inference and data pipelines.
- Design, implement, and manage Kubernetes clusters, ensuring efficient container orchestration, autoscaling, and rolling deployments.
- Develop and maintain Terraform modules for reproducible infrastructure provisioning across multiple environments.
- Build and optimize CI/CD pipelines (GitHub Actions, Jenkins, or similar) to automate testing, linting, and deployment of services.
- Collaborate with data science, backend, and product teams to integrate new AI models and features into the platform.
- Implement observability, logging, and monitoring solutions (Prometheus, Grafana, CloudWatch) to detect and remediate incidents proactively.
Requirements
- 10+ years of experience in cloud platform engineering, with a strong focus on AWS services.
- Proficiency in Python and container technologies (Docker, Kubernetes).
- Hands‑on experience with IaC tools, especially Terraform, and CI/CD pipeline design.
- Deep understanding of distributed systems, networking, and security best practices.
- Excellent communication skills and a proven ability to mentor junior engineers.
Skills
pythonawskubernetesdockerterraformcicd