onsite
AI Platform Architect - Aalo Atomics
Software Engineer
Lead the design and deployment of scalable AI platforms on AWS, leveraging Python, Kubernetes, and Terraform to deliver robust machine learning solutions.
About the role
Key Responsibilities
- Architect end‑to‑end AI infrastructure on AWS, ensuring high availability, security, and cost efficiency.
- Design and implement containerized ML pipelines using Docker and Kubernetes, integrating CI/CD workflows.
- Develop reusable Terraform modules for rapid provisioning of compute, storage, and networking resources.
- Collaborate with data scientists to translate model prototypes into production‑ready services.
- Monitor platform performance, troubleshoot issues, and optimize resource utilization.
Requirements
- 5+ years of experience building AI/ML platforms in cloud environments.
- Proficiency in Python, AWS services (SageMaker, ECS, EKS, Lambda), and Kubernetes.
- Hands‑on experience with Terraform, Docker, and CI/CD pipelines.
- Strong understanding of ML model deployment, monitoring, and governance.
- Excellent communication skills and ability to mentor cross‑functional teams.
Skills
pythonawsmachine learningkubernetesterraformdocker