remote
Staff/Principal AI Scientist - Mixture of Architecture Foundation Models - SAP
Software Engineer
Lead cutting‑edge research on mixture‑of‑architecture foundation models, driving scalable deep learning solutions with PyTorch/TensorFlow on cloud platforms.
About the role
Key Responsibilities
- Design, prototype, and deploy novel mixture‑of‑architecture foundation models that integrate multiple neural network paradigms.
- Lead end‑to‑end research cycles, from hypothesis to production‑ready models, ensuring reproducibility and performance at scale.
- Collaborate with cross‑functional teams to translate research insights into industry‑ready AI services.
- Mentor junior researchers and engineers, fostering a culture of experimentation and continuous learning.
- Publish findings in top conferences and maintain an active presence in the AI research community.
Requirements
- Ph.D. or equivalent experience in Machine Learning, Computer Science, or related field.
- Deep expertise in deep learning frameworks (PyTorch, TensorFlow) and large‑scale model training.
- Proven track record of publishing high‑impact research on foundation models or related areas.
- Strong programming skills in Python and experience with distributed training on cloud platforms (AWS, GCP).
- Excellent communication skills and ability to translate complex concepts to non‑technical stakeholders.
Skills
machine learningdeep learningpytorchtensorflowreinforcement learning