Research Engineer/Scientist (all levels), Efficient Models
The Vision-Applied Research team at ByteDance is seeking a Research Engineer/Scientist to design and implement efficient models for large-scale generative AI, focusing on large model distillation and compression. This role involves developing methods and infrastructure for transferring capabilities from foundation models to smaller, more efficient models, enabling scalable training, optimization, and deployment. Responsibilities include developing efficient algorithms and architectures for generative and multimodal models using techniques like step distillation, cfg distillation, and quantization.
The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to ByteDance products, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans. The team is looking for a Research Engineer / Scientist who can take initiatives in designing and implementing efficient models for large-scale generative AI, with a particular emphasis on large model distillation and compression. The candidate will work on developing methods and infrastructure for transferring capabilities from foundation models into smaller, more efficient models, enabling scalable training, optimization, and deployment. Responsibilities may include, but are not limited to, distillation frameworks, model acceleration, hardware-efficient inference, and their applications.
Posted June 7, 2026