onsite
Machine Learning Engineer - Generative AI - Qualcomm
ML Engineer
Experienced Machine Learning Engineer focused on Generative AI, building and deploying large language model solutions, RAG pipelines, and intelligent agents using Python and deep learning frameworks.
About the role
Key Responsibilities
- Design and implement Retrieval‑Augmented Generation pipelines to integrate external knowledge bases with large language models.
- Develop, fine‑tune, and optimize LLMs for domain‑specific tasks, ensuring high performance and low latency.
- Build intelligent agent architectures that combine LLM reasoning with real‑time data retrieval and decision making.
- Collaborate with cross‑functional teams to integrate AI solutions into modem software stacks and cloud services.
- Conduct experiments, benchmark models, and produce reproducible research documentation.
Requirements
- 5+ years of professional experience in machine learning, with a focus on generative AI and large language models.
- Strong proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.
- Hands‑on experience designing RAG systems, fine‑tuning LLMs, and deploying models at scale.
- Solid understanding of cloud platforms (e.g., AWS, Azure) and containerization for model serving.
- Excellent problem‑solving skills and ability to work in a fast‑paced, collaborative environment.
Skills
pythonpytorchtensorflowmachine learning