Member of Technical Staff - Inference
Runway is seeking an ML infrastructure engineer to serve as a Member of Technical Staff, Inference. This role involves bridging the gap between research and production by productionizing cutting-edge generative models and building infrastructure for fast iteration. The successful candidate will work directly with research teams to ensure reliability at scale and support the entire model development lifecycle.
We're looking for an ML infrastructure engineer to bridge the gap between research and production at Runway. You'll work directly with our research teams to productionize cutting-edge generative models—taking checkpoints from training to staging to production, ensuring reliability at scale, and building the infrastructure that enables fast iteration.
You'll be embedded within research teams, providing platform support throughout the entire model development lifecycle. Your work will directly impact how quickly we can ship new models and features to millions of users.
Our API endpoints for real-time collaboration and media asset management is written in TypeScript, and runs in ECS containers on AWS Fargate. We leverage multiple AWS-native components, such as S3, CloudFront, Lambda, Kinesis, and SQS, as building blocks of our infrastructure.
Our inference backend is written in Python (PyTorch, TorchScript), and is deployed across multiple clusters / cloud providers. We use Kubernetes for container orchestration, and k8s-native components such as Flyte, Kueue, and Kyverno efficient job orchestration. We invest in prometheus and grafana for monitoring, and Terraform to manage our infrastructure.
Posted June 8, 2026