onsite
Cloud Platform Engineer - AI/ML Platform - State Street
Devops Engineer
Senior Cloud Platform Engineer focused on AI/ML workloads, designing and maintaining secure, scalable cloud infrastructure, CI/CD pipelines, and MLOps workflows using Kubernetes and cloud-native automation.
About the role
Key Responsibilities
- Design, implement, and manage cloud infrastructure that supports AI and machine learning applications, ensuring high availability and security.
- Build and maintain end‑to‑end CI/CD pipelines for both application code and machine learning model deployments.
- Automate infrastructure provisioning and configuration using IaC tools, integrating with Kubernetes clusters and cloud services.
- Implement MLOps best practices, including model versioning, monitoring, and rollback strategies.
- Collaborate with data scientists, software engineers, and security teams to optimize performance and compliance.
Requirements
- 8–12+ years of experience in DevOps or cloud platform engineering, with a strong focus on AI/ML workloads.
- Proficiency in cloud-native technologies (Kubernetes, Helm, Istio) and major cloud providers (AWS, GCP, or Azure).
- Hands‑on experience with CI/CD tools (GitLab CI, Jenkins, ArgoCD) and infrastructure automation (Terraform, Pulumi, Ansible).
- Deep understanding of MLOps concepts, model registry, and monitoring solutions.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
cicdkubernetesmlops