remote
MLOps Engineer LLM/GenAI - HSBC
MLOps Engineer
Lead the design and deployment of production‑grade MLOps pipelines for large language models and generative AI solutions, leveraging Docker, Kubernetes, AWS, and CI/CD to ensure scalable, secure, and reliable model serving.
About the role
Key Responsibilities
- Design, build, and maintain end‑to‑end MLOps pipelines for large language models and generative AI applications.
- Containerize models using Docker and orchestrate deployments on Kubernetes clusters in AWS.
- Implement CI/CD workflows for automated testing, model versioning, and continuous delivery.
- Ensure model governance, monitoring, and compliance with security and regulatory standards.
- Collaborate with data scientists, software engineers, and product teams to translate research prototypes into production services.
Requirements
- Strong experience with Python, Docker, Kubernetes, and AWS services (EKS, S3, SageMaker).
- Hands‑on knowledge of MLOps tools such as MLflow, Kubeflow, or similar.
- Proficiency in CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD).
- Experience deploying and scaling large language models and generative AI workloads.
- Excellent problem‑solving skills and a collaborative mindset.
Skills
pythondockerkubernetesawscicdmachine learninggenerative ai