remote
Senior Software Engineer - Model Training & AI Evals - Chegg
Software Engineer
Senior Software Engineer focused on model training and AI evaluations, building end‑to‑end evaluation pipelines, synthetic data generation, and benchmarking for foundation models using Python, PyTorch, and TensorFlow.
About the role
Key Responsibilities
- Design and maintain the end‑to‑end evaluation and benchmarking infrastructure for large language models.
- Develop and optimize post‑training pipelines for industry‑specific foundation models.
- Create synthetic datasets to target identified capability gaps and failure modes.
- Define and refine skill taxonomies, evaluation metrics, and success criteria for model assessment.
- Collaborate with research and product teams to integrate evaluation results into the model improvement cycle.
Requirements
- 5+ years of software engineering experience in AI/ML, with a focus on large language models.
- Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.
- Strong background in evaluation methodology, metric design, and benchmarking at scale.
- Experience generating synthetic data and working with large datasets.
- Excellent problem‑solving skills and ability to communicate complex concepts to cross‑functional teams.
Skills
pythonpytorchtensorflow