onsite
Senior Machine Learning Operations Engineer - BetMGM LLC
ML Engineer
Lead the design and deployment of scalable machine learning pipelines using Python, Docker, Kubernetes, and AWS, while implementing CI/CD and infrastructure-as-code with Terraform to ensure reliable, production‑grade ML operations.
About the role
Key Responsibilities
- Architect, build, and maintain end‑to‑end ML pipelines that support real‑time inference and batch scoring at scale.
- Containerize models with Docker, orchestrate deployments on Kubernetes, and manage cluster lifecycle in AWS.
- Implement CI/CD workflows for model training, testing, and deployment using GitHub Actions, Jenkins, or similar tools.
- Write and maintain Terraform scripts to provision and manage cloud resources, ensuring reproducibility and compliance.
- Collaborate with data scientists, software engineers, and product teams to translate model requirements into production‑ready solutions.
- Monitor model performance, detect drift, and automate retraining pipelines.
Requirements
- 5+ years of experience in ML operations or related roles.
- Experience with model monitoring, logging, and alerting tools (Prometheus, Grafana, CloudWatch).
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
machine learningpythondockerkubernetesawscicdterraform