onsite

Software Engineer II - AI/ML, AWS Neuron - Amazon

Software Engineer

Senior software engineer focused on building and optimizing the AWS Neuron SDK to accelerate deep learning and GenAI workloads on custom ML accelerators, leveraging Python, C++, and AWS technologies.

About the role

Key Responsibilities

Design, develop, and maintain high‑performance components of the AWS Neuron SDK, including the ML compiler, runtime, and application framework.
Collaborate with cross‑functional teams to integrate new features and performance improvements for Inferentia and Trainium accelerators.
Optimize code for low‑latency inference and high‑throughput training, ensuring scalability across large model deployments.
Debug, profile, and troubleshoot complex issues in distributed environments, providing actionable insights to improve reliability.
Contribute to open‑source tooling and internal libraries, documenting best practices and performance guidelines.

Requirements

Strong experience in C++ and Python for performance‑critical systems.
Deep knowledge of machine learning frameworks (e.g., TensorFlow, PyTorch) and model optimization techniques.
Hands‑on experience with AWS services and custom ML accelerators such as Inferentia and Trainium.
Proficiency in profiling, debugging, and performance tuning of large‑scale ML workloads.
Excellent communication skills and a collaborative mindset for working in a fast‑paced, distributed team.

Skills

pythoncmachine learningaws

CompanyAmazon

DepartmentEngineering

LocationSeattle, United States

Experience3+ years

Tenurefull-time

LevelMid-Level

Posted June 23, 2026