Inference Technical , Sora
OpenAI is seeking an Inference Technical Lead for the Sora team, focusing on GPU Inference to enhance model serving efficiency. This high-impact role involves optimizing inference performance and scalability, collaborating with research teams, and designing critical serving infrastructure.
The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societal benefit.
We’re looking for a GPU Inference Engineer to contribute to improvements in model serving efficiency for Sora. This is a high-impact role where you’ll drive initiatives to optimize inference performance and scalability. You’ll also be engaged in model design, to help assist our researchers in developing inference-friendly models.
This role is critical to scaling the team’s broader goals - it will directly enable leadership to focus on higher-leverage initiatives by building a stronger technical foundation.
Posted June 6, 2026