Manager I, Engineering - AI Platform - Training & Serving
Manage AI platform infrastructure, enabling data scientists and engineers to conduct large-scale training and inference with ease, using tools and platforms such as Bits AI, LLMObs, and AI research.
The AI platform is responsible for all AI infrastructure across Datadog. Our mission is to
provide tools and platforms that enable data scientists and engineers to conduct large-scale training and inference with ease. We support products such as Bits AI , LLMObs and all our AI research .
As an engineering manager for the Training & Serving team, you’ll join a new and fast growing team and organization. You will support building and scaling the team, define our technical vision and help shape the roadmap. Your team will lead the charge on multiple critical technical challenges: distributed training of foundation models, serving at scale, designing the user experience.
You’ll work closely with sister teams in the AI platform organization ensuring a seamless AI development cycle. You’ll also partner with the Applied AI org and with Datadog infrastructure & tooling teams to build out systems from the ground up.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You’ll Do:
Who You Are:
Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay.
Posted June 7, 2026