remote
Senior MLOps & Generative AI Engineer - Sentara Hospitals
AI Engineer
Lead the design, deployment, and scaling of generative AI models in a fully remote role, leveraging Python, Docker, Kubernetes, AWS, Terraform, and CI/CD pipelines to deliver production‑ready ML solutions.
About the role
Key Responsibilities
- Architect, build, and maintain end‑to‑end MLOps pipelines for training, serving, and monitoring large generative AI models.
- Implement containerized environments using Docker and orchestrate workloads on Kubernetes clusters in AWS.
- Develop infrastructure as code with Terraform to ensure reproducible, secure, and scalable cloud resources.
- Integrate CI/CD tools (e.g., GitHub Actions, Jenkins) to automate model versioning, testing, and deployment.
- Collaborate with data scientists to optimize model performance, manage experiment tracking with MLflow, and ensure compliance with security and governance standards.
Requirements
- 5+ years of hands‑on experience in MLOps, DevOps, or cloud engineering, with a strong focus on AI/ML workloads.
- Proficiency in Python and container technologies (Docker, Kubernetes) for building scalable model services.
- Deep knowledge of AWS services (EKS, S3, SageMaker, IAM) and infrastructure‑as‑code tools such as Terraform.
- Experience designing CI/CD pipelines and using experiment‑tracking platforms like MLflow or similar.
- Solid understanding of generative AI concepts, large language models, and their production challenges.
Skills
pythondockerkubernetesawsterraformcicdmlflow