onsite

Senior Software Engineer, AI/ML, LLM Modeling

Software Engineer

Senior Software Engineer specializing in AI/ML, building and optimizing large language model pipelines for data processing, distillation, and information retrieval using Python and deep learning frameworks.

About the role

Key Responsibilities

Design, develop, and maintain end‑to‑end pipelines for training, fine‑tuning, and serving large language models (LLMs) on diverse text corpora.
Implement data processing and distillation techniques to improve model efficiency and retrieval performance.
Collaborate with research and product teams to translate state‑of‑the‑art NLP research into production‑ready solutions.
Optimize model inference latency and scalability on cloud infrastructure, ensuring robust monitoring and logging.
Write clean, testable code and contribute to code reviews, documentation, and best‑practice standards.

Requirements

5+ years of professional software engineering experience with Python and deep learning libraries such as PyTorch or TensorFlow.
Hands‑on experience building, fine‑tuning, or deploying large language models (e.g., GPT, BERT, T5) for NLP tasks.
Strong understanding of data processing, model distillation, and information retrieval techniques.
Proficiency in designing scalable, high‑performance systems for model training and inference.
Excellent problem‑solving skills and ability to work independently in a fast‑moving, research‑driven environment.

Skills

pythonpytorchtensorflownatural language processingmachine learning

DepartmentResearch

LocationZürich, Switzerland

Experience5+ years

Tenurefull-time

LevelSenior

Posted June 19, 2026