remote
Intermediate II DevOps/MLOps - Global Relay
Software Engineer
Intermediate-level DevOps/MLOps engineer responsible for building and maintaining scalable CI/CD pipelines, containerized ML workloads, and cloud infrastructure on AWS, ensuring high availability, security, and performance for enterprise data solutions.
About the role
Key Responsibilities
- Design, implement, and maintain CI/CD pipelines for both application and machine learning model deployments.
- Build and manage Docker images, Kubernetes clusters, and Helm charts to support scalable, resilient services.
- Collaborate with data science and software teams to automate model training, validation, and production rollout.
- Monitor system health, troubleshoot performance bottlenecks, and enforce security best practices across cloud environments.
- Document infrastructure as code and contribute to internal knowledge bases.
Requirements
- 3+ years of experience in DevOps or MLOps roles, with hands‑on work in CI/CD, Docker, and Kubernetes.
- Proficiency with AWS services (EKS, ECS, S3, CloudWatch, IAM) and infrastructure‑as‑code tools (Terraform, CloudFormation).
- Strong scripting skills in Python and Bash, plus familiarity with ML frameworks (TensorFlow, PyTorch, scikit‑learn).
- Experience with monitoring/alerting tools (Prometheus, Grafana, Datadog) and log aggregation (ELK, CloudWatch Logs).
- Excellent problem‑solving abilities, communication skills, and a collaborative mindset.
Skills
mlopscicddockerkubernetesawspython