onsite
Senior AI Infrastructure Engineer - Staffed4U
Devops Engineer
Senior AI Infrastructure Engineer leading design, deployment, and operation of scalable AI/ML platforms using Python, Kubernetes, Docker, AWS, Terraform, and CI/CD pipelines to deliver enterprise‑grade AI services.
About the role
Key Responsibilities
- Architect and build highly available, scalable AI/ML infrastructure on AWS, leveraging Kubernetes, Docker, and Terraform for infrastructure as code.
- Design and maintain CI/CD pipelines for model training, validation, and deployment, ensuring rapid, reliable delivery of AI services.
- Collaborate with data scientists and ML engineers to optimize model performance, resource utilization, and cost efficiency.
- Implement monitoring, logging, and alerting for AI workloads, ensuring uptime and compliance with security standards.
- Lead incident response and root‑cause analysis for production AI systems, driving continuous improvement.
Requirements
- 5+ years of experience in AI/ML infrastructure engineering or related roles.
- Strong understanding of ML Ops principles, model versioning, and data pipeline orchestration.
- Experience with security best practices, including IAM, encryption, and compliance frameworks.
- Excellent problem‑solving skills and ability to work independently in a fast‑paced environment.
Skills
pythonkubernetesdockerawsterraformcicd