remote
AI Research Engineer - Pre training
AI Research Engineer
Senior AI Research Engineer to develop and optimize pre-training methodologies for large-scale language models using advanced deep learning techniques.
About the role
Key Responsibilities
- Design and implement pre-training strategies for large-scale AI models
- Optimize training pipelines for efficiency and scalability
- Collaborate with cross-functional teams to integrate models into production systems
- Research novel techniques in neural network architectures and training methodologies
- Evaluate model performance and iterate on improvements
- Document research findings and contribute to technical reports
Requirements
- Master's or PhD in Computer Science, AI, or related field
- 3+ years of experience in machine learning research or model training
- Strong proficiency in Python and deep learning frameworks (PyTorch/TensorFlow)
- Experience with large-scale distributed training systems
- Publications or contributions to open-source AI projects are a plus
Skills
machine learningdeep learningpythonpytorchnatural language processingmodel training