onsite
Principal Product Manager - AI/ML Training - Amazon
Product Manager
Lead product strategy for AWS Trainium’s training stack, driving distributed deep‑learning libraries and post‑training workflows to enable high‑performance, cost‑efficient generative AI workloads.
About the role
Key Responsibilities
- Define and execute the product roadmap for Trainium’s training software, including distributed training libraries and post‑training pipelines.
- Collaborate with engineering, data science, and customer success teams to gather requirements and translate them into scalable, high‑performance solutions.
- Drive innovation in RLHF, DPO, and other advanced post‑training techniques to support frontier generative AI models.
- Analyze market trends, competitor offerings, and customer feedback to inform feature prioritization and positioning.
- Own product metrics, iterate on performance and usability, and ensure alignment with AWS Neuron’s overall strategy.
Requirements
- 10+ years of product management experience in AI/ML or high‑performance computing environments.
- Deep technical knowledge of distributed deep‑learning frameworks, GPU/TPU architectures, and cloud‑scale training.
- Proven track record delivering complex, customer‑centric products on AWS or similar cloud platforms.
- Strong analytical, communication, and stakeholder‑management skills.
- Experience with RLHF, DPO, or related generative AI post‑training workflows is highly desirable.
Skills
awsdeep learninggenerative ai