onsite
Senior Software Engineer, AI/ML, LLM Modeling
Software Engineer
Senior Software Engineer specializing in AI/ML, building and optimizing large language model pipelines for data processing, distillation, and information retrieval using Python and deep learning frameworks.
About the role
Key Responsibilities
- Design, develop, and maintain end‑to‑end pipelines for training, fine‑tuning, and serving large language models (LLMs) on diverse text corpora.
- Implement data processing and distillation techniques to improve model efficiency and retrieval performance.
- Collaborate with research and product teams to translate state‑of‑the‑art NLP research into production‑ready solutions.
- Optimize model inference latency and scalability on cloud infrastructure, ensuring robust monitoring and logging.
- Write clean, testable code and contribute to code reviews, documentation, and best‑practice standards.
Requirements
- 5+ years of professional software engineering experience with Python and deep learning libraries such as PyTorch or TensorFlow.
- Hands‑on experience building, fine‑tuning, or deploying large language models (e.g., GPT, BERT, T5) for NLP tasks.
- Strong understanding of data processing, model distillation, and information retrieval techniques.
- Proficiency in designing scalable, high‑performance systems for model training and inference.
- Excellent problem‑solving skills and ability to work independently in a fast‑moving, research‑driven environment.
Skills
pythonpytorchtensorflownatural language processingmachine learning