remote
AI Infrastructure Engineer - BV Teck
Devops Engineer
Lead the design and maintenance of scalable AI infrastructure, leveraging Python, AWS, Kubernetes, Docker, and Terraform to deliver robust ML pipelines and production-ready services.
About the role
Key Responsibilities
- Architect, deploy, and manage AI/ML workloads on AWS using services such as SageMaker, ECS, and EKS.
- Build and maintain containerized pipelines with Docker and Kubernetes, ensuring high availability and scalability.
- Implement IaC with Terraform to provision and version infrastructure across multiple environments.
- Integrate CI/CD workflows for model training, testing, and deployment using GitHub Actions or Jenkins.
- Collaborate with data scientists to optimize model performance and streamline data ingestion pipelines.
- Monitor system health, troubleshoot performance bottlenecks, and enforce security best practices.
Requirements
- 3+ years of experience in AI/ML infrastructure or DevOps roles.
Skills
pythonawskubernetesdockerterraformcicd