remote
Staff AI/ML Platform Engineer - Albert Invent Corp
Devops Engineer
Lead the design and scaling of AI/ML infrastructure, building robust pipelines and production‑ready models on AWS and Kubernetes, while driving MLOps best practices and continuous delivery.
About the role
Key Responsibilities
- Architect and maintain end‑to‑end ML pipelines from data ingestion to model deployment using Python, TensorFlow, and PyTorch.
- Design scalable, containerized services on Kubernetes and manage infrastructure on AWS (EKS, S3, SageMaker).
- Implement CI/CD workflows for model training, testing, and rollout, ensuring reproducibility and rapid iteration.
- Collaborate with data scientists and product teams to translate research prototypes into production‑grade solutions.
- Monitor model performance, set up automated drift detection, and orchestrate retraining cycles.
Requirements
- 10+ years of software engineering experience with a focus on AI/ML systems.
- Deep expertise in Python, TensorFlow/PyTorch, and Kubernetes orchestration.
- Proven track record building MLOps pipelines on AWS and implementing CI/CD for ML workflows.
- Strong problem‑solving skills and ability to mentor junior engineers.
- Excellent communication and collaboration across cross‑functional teams.
Skills
pythontensorflowpytorchkubernetesawscicdmlops