Senior Machine Learning Engineer, DevOps/SRE
Roku is looking for a Senior Machine Learning Engineer, DevOps/SRE, to join their Advertising Performance team. This role involves designing and operating scalable cloud infrastructure for ML workloads on AWS and GCP, improving CI/CD systems for ML models, and partnering with data scientists to streamline the ML lifecycle. The ideal candidate will have strong DevOps/SRE experience with a passion for building platforms that accelerate ML experimentation and deployment at internet scale.
The Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers, and Roku. The systems and solutions span multiple disciplines and technologies to perform real-time multi-objective optimization across distributed systems at large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization Systems, and Auction Dynamics to solve a large set of complex problems. At the core of this is our Machine Learning, Experimentation, and Inference Platform that powers the entire landscape, which we continuously evolve over time.
We are seeking a talented and experienced Senior Software Engineer, MLOps/DevOps, to join the Advertising Performance team and play a critical role in supporting and scaling our Machine Learning infrastructure. The ideal candidate has a strong background in DevOps/SRE practices, cloud infrastructure management, and MLOps tooling — with a passion for building platforms that accelerate ML experimentation and deployment at internet scale.
You will partner closely with ML Scientists and Engineers to streamline the end-to-end ML lifecycle across training, evaluation, deployment, and monitoring — on top of a modern, cloud-native stack running on GCP and AWS using Kubernetes, Apache Airflow, Spark, Ray, MLflow, Chronon, etc.
Posted June 10, 2026