Product Manager, Public Sector GenAI Test & Evaluation (T&E)
The Product Manager, Public Sector GenAI T&E, will lead the vision and roadmap for Scale's evaluation capabilities for government agencies, focusing on the T&E tech stack for continuously measuring and improving AI application performance. This role involves identifying bottlenecks across engineering organizations, distilling technical friction into actionable plans, and driving execution to ensure robust evaluation services for demanding public sector use cases.
At Scale, our mission is to develop reliable AI systems for the world’s most important decisions. The Public Sector team is at the forefront of this mission, partnering with government agencies to deploy mission-critical agentic solutions.
The Public Sector GenAI T&E Product Manager will be a high-horsepower technical leader, defining the vision and owning the roadmap for our evaluation capabilities. This role requires thriving in unscripted, high-stakes environments, as you will be the primary owner for the T&E tech stack—the robust infrastructure required to continuously measure, improve, and prove the superiority and sustained performance of our agentic applications.
Traversing multiple engineering organizations across Scale, you will identify bottlenecks, distill technical friction into actionable plans, and drive execution. You will work across Scale’s commercial and public sector teams to define requirements, ensuring our evaluation services are robust enough for the most demanding government use cases. Key objectives include refining the tech stack that allows ML teams to hillclimb, and surfacing critical performance information to stakeholders.
Posted May 29, 2026