Lead ML Inference Engineer, Advertising
As a Lead ML Inference Engineer, Advertising, you will architect, design, and lead the development of a state-of-the-art inference platform to handle advertising-level low latencies, scale, throughput, and availability. This role involves optimizing performance across hardware, software, and models, and requires a strong technical leader with deep experience in ML serving and high-performance computing.
The Advertising Performance group focuses on performance for all participants in the Advertising ecosystem - Advertisers, Publishers, and Roku. The systems and solutions span multiple disciplines and technologies to perform real-time multi-objective optimization across distributed systems at large scale and with low latency. We use Machine Learning, Reinforcement Learning, AI, Control and Optimization Systems, and Auction Dynamics to solve a large set of complex problems. At the core of this is our Machine Learning and Inference Platform that powers the entire landscape.
In this role, you will architect, design, and lead the development of a SOTA Inference platform that can handle Advertising-level low latencies, scale, throughput, and availability with optimizations that span across hardware, software, and models. We’re looking for a strong technical leader with deep experience in ML serving, high-performance computing, and industry standard frameworks - someone excited to mentor engineers, innovate at scale, and shape the future of machine learning at Roku.
Posted June 2, 2026