onsite
Sr. Machine Learning Framework, Complier & Performance Engineer - Qualcomm
ML Engineer
Senior engineer driving ML framework, compiler, and performance optimization using C++, Python, and LLVM to accelerate next‑generation AI workloads.
About the role
Key Responsibilities
- Design, develop, and maintain high‑performance ML frameworks and compiler passes for Qualcomm’s AI stack.
- Implement performance‑critical kernels and optimizations in C++ and CUDA, targeting embedded and mobile platforms.
- Collaborate with cross‑functional teams to integrate new ML models and algorithms into production pipelines.
- Profile, benchmark, and tune ML workloads to meet stringent latency and throughput targets.
- Contribute to open‑source LLVM projects and internal tooling to support scalable ML development.
Requirements
- 5+ years of experience in ML systems, compiler construction, or performance engineering.
- Proficiency in C++ (modern standards), Python, and LLVM infrastructure.
- Strong background in GPU/CPU optimization, CUDA, and low‑level performance analysis.
- Experience with machine learning frameworks (TensorFlow, PyTorch) and model deployment pipelines.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
machine learningcpython