remote
AI Platform Engineer - Hellmann
Devops Engineer
Lead the design, deployment, and scaling of AI solutions on cloud platforms, leveraging Python, TensorFlow, Kubernetes, and AWS to deliver robust, production‑grade machine learning services.
About the role
Key Responsibilities
- Architect and maintain end‑to‑end AI pipelines from data ingestion to model serving on AWS.
- Containerize models with Docker and orchestrate deployments using Kubernetes.
- Implement CI/CD workflows for automated testing, model validation, and continuous delivery.
- Collaborate with data scientists to translate research prototypes into scalable production services.
- Monitor model performance, troubleshoot issues, and iterate on infrastructure for optimal latency and throughput.
Requirements
- Proven experience building AI/ML platforms in production environments.
- Strong proficiency in Python, TensorFlow or PyTorch, and container orchestration.
- Hands‑on expertise with AWS services (EKS, SageMaker, S3, Lambda).
- Solid understanding of CI/CD pipelines and automated testing for ML workflows.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythontensorflowkubernetesdockerawscicd