onsite
Senior ML Platform Engineer
ML Platform Engineer
As a Senior ML Platform Engineer, you will be responsible for building and optimizing the machine learning infrastructure for large language models. This includes maintaining training pipelines, enhancing LLM inference performance, and deploying models at scale while collaborating with other engineering teams.
About the role
About the Role
Baseten is seeking a Senior ML Platform Engineer to join our team. In this role, you will be instrumental in building and optimizing our machine learning infrastructure to support the training, serving, and management of large language models (LLMs).
Responsibilities
- Build and maintain training infrastructure, feature stores, and model serving pipelines.
- Optimize LLM inference performance, focusing on compute efficiency, memory management, latency, and throughput.
- Read, debug, and contribute to LLM runtime and supporting library code, primarily in Rust and/or C++.
- Deploy and manage models at scale using tools like vLLM and Baseten.
- Architect scalable pipelines for model training and serving across GPU infrastructure.
- Collaborate with ML and data engineers to ensure the platform meets research and production needs.
Skills
RustC++VllmBasetenLlm InferenceGPU infrastructureMl Platformsfeature storesmodel serving pipelinescompute efficiencymemory managementlatencythroughput