onsite
Senior AI Cloud Infrastructure Engineer - Newrez
Devops Engineer
Lead the design and operation of AI‑driven cloud infrastructure, building scalable, secure, and highly available environments on AWS. Drive automation, CI/CD, and ML‑ops best practices to support data‑centric applications.
About the role
Key Responsibilities
- Architect, deploy, and maintain AI/ML workloads on AWS, ensuring high availability, scalability, and cost efficiency.
- Implement infrastructure as code using Terraform and manage Kubernetes clusters for containerized AI services.
- Design and enforce security, compliance, and monitoring strategies across the cloud stack.
- Collaborate with data scientists and ML engineers to streamline model training, deployment, and monitoring pipelines.
- Automate CI/CD workflows for AI applications, integrating unit tests, model validation, and rollback mechanisms.
Requirements
- 5+ years of experience in cloud infrastructure engineering, with a focus on AI/ML workloads.
- Proficiency in Python, AWS services (SageMaker, ECS/EKS, Lambda, S3, RDS), Kubernetes, and Terraform.
- Strong understanding of security best practices, IAM, VPC, and compliance frameworks.
- Experience with CI/CD tools (GitHub Actions, Jenkins, ArgoCD) and monitoring (Prometheus, Grafana, CloudWatch).
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythonawskubernetesterraformmachine learning