remote
Senior MLOps & Generative AI Engineer - Sentara
AI Engineer
Lead end‑to‑end MLOps and generative AI initiatives, building scalable pipelines, deploying models with Docker/Kubernetes on AWS, and driving continuous integration and delivery for high‑impact AI solutions.
About the role
Key Responsibilities
- Design, develop, and maintain production‑grade MLOps pipelines for generative AI models, ensuring reliability, scalability, and security.
- Implement containerization (Docker) and orchestration (Kubernetes) strategies to deploy models across cloud environments, primarily AWS.
- Collaborate with data scientists and software engineers to integrate model training, evaluation, and monitoring into CI/CD workflows.
- Automate model versioning, rollback, and A/B testing, leveraging tools such as MLflow, Airflow, or similar.
- Optimize model performance and resource utilization, applying best practices for inference latency and cost efficiency.
Requirements
- 5+ years of experience in MLOps, with a strong focus on generative AI or large language models.
- Hands‑on experience with CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD) and model monitoring tools.
- Excellent problem‑solving skills and ability to work independently in a fully remote setting.
Skills
mlopsgenerative aipythondockerkubernetesawscicd