onsite
Software Engineer II Research Engineer - AI/ML Systems - Career Moves Group
Research Engineer
Research Engineer building and evaluating LLM‑driven agents for fundamental AI research, implementing prototypes in Python and PyTorch, and scaling experiments on distributed compute resources.
About the role
Key Responsibilities
- Design, develop, and iterate on autonomous agents that leverage large language models for research tasks.
- Implement end‑to‑end ML pipelines in Python, using PyTorch for model training, fine‑tuning, and inference.
- Collaborate with scientists to translate research concepts into production‑ready code and experimental frameworks.
- Optimize computational workloads across multi‑GPU/CPU clusters, ensuring reproducibility and scalability.
- Document code, experiment results, and best practices to support knowledge sharing within the team.
Requirements
- Strong proficiency in Python and experience with deep‑learning libraries such as PyTorch.
- Hands‑on experience building or fine‑tuning large language models (e.g., GPT, BERT, LLaMA).
- Solid understanding of machine‑learning fundamentals, including model evaluation, data pipelines, and distributed training.
- Ability to work independently in a fast‑paced research environment while collaborating effectively with cross‑functional teams.
- Experience with cloud or on‑premise high‑performance compute resources (GPU clusters, container orchestration, etc.).
Skills
pythonpytorchmachine learning