onsite

Large Scale Video Understanding Research Scientist

As a Large Scale Video Understanding Research Scientist, you will enhance video generation quality and efficiency by improving video and audio understanding pipelines for training data construction and model evaluation. This role involves hands-on work with large-scale Video Language Models (VLLMs), including fine-tuning and control, as well as implementing computer vision and signal processing algorithms.

About the role

About Lightricks

Lightricks is an AI-first company that develops next-generation content-creation technology for businesses, enterprises, and studios, aiming to bridge the gap between imagination and creation. The company's core technology is LTX-2, an open-source generative video model designed for expressive, high-fidelity video at unmatched speed, powering both its own products and a growing ecosystem of partners via API integration. Lightricks is also known globally for pioneering consumer creativity with products like Facetune, an AI-powered visual expression tool used by hundreds of millions of users worldwide. The company combines deep research, user-first design, and end-to-end execution to bring the future of expression to all.

Team & Role

The Core Generative AI team at Lightricks Research is a unified group of researchers and engineers focused on developing generative foundational models for LTX Studio, an AI-based video creation platform. The team's primary goal is to create a controllable, cutting-edge video generative model by integrating advanced algorithms with exceptional engineering. This involves enhancing machine learning components within a sophisticated internal training framework crucial for developing advanced models. The team specializes in research and engineering that enable efficient and scalable training and inference, delivering state-of-the-art AI-generated video models.

As a Large Scale Video Understanding Research Scientist, you will be instrumental in improving video generation quality and efficiency. This will be achieved by enhancing video and audio understanding pipelines used for both training data construction and model evaluation. The role requires hands-on experience with large-scale Video Language Models (VLLMs), including fine-tuning, post-training, and control. It also involves implementing classic computer vision and signal processing algorithms, alongside strong research skills. Expertise in post-training and controlling large-scale foundational models, understanding statistics, implementing complex systems, and debugging will be critical, especially given that video training sets comprise petabytes of data processed across hundreds to thousands of virtual machines.

What you will be doing

Fine-tune and control VLLMs for video and audio understanding.
Design algorithms for balancing, filtering, and curating training and evaluation datasets, informed by model behavior and failure modes.
Implement classic and modern algorithms for processing, clustering, evaluation, and filtering of large-scale datasets.
Work within high-performance, scalable distributed systems capable of handling petabytes of data, with attention to throughput, correctness, and reproducibility.
Collaborate with other researchers and product stakeholders to iteratively improve training sets and evaluation protocols through tight feedback loops driven by model performance.

Your skills and experience

Experience training, fine-tuning, or post-training large-scale VLLMs or multimodal foundation models.
Strong software engineering skills, proficient in Jax or PyTorch.
Ability to develop and implement computer vision models for data filtering and evaluation.
Understanding of relevant topics in statistics and clustering.
Enjoys delving into system implementations to enhance performance and maintainability.

This role is designed for individuals who are not only technically proficient but also deeply passionate about pushing the boundaries of AI and machine learning through innovative engineering and collaborative research.

Why Join Us

Lightricks aims to push the boundaries of what’s possible with AI and video, focusing on the craft, the challenge, and creating genuinely new solutions. The company fosters an environment where people are encouraged to think, create, and explore, believing that real impact comes from empowerment, experimentation, evolution, and collaboration. At Lightricks, breakthroughs stem from great people and a collaborative mindset. If you seek a place that combines deep tech, creative energy, and a zero-buzzword culture, Lightricks might be the right fit.

Benefits

Daily door-to-door shuttles, Car-to-go subscriptions from various locations in central Israel, plus free parking and train-station pickups.
Two chef-led restaurants on-site by the Machneyuda Group, along with a bakery.
Access to cutting-edge tools and learning opportunities for professional growth, including workshops, platform access and training, subscriptions, and clear guidelines for responsible AI use.

About the role

About Lightricks

Team & Role

What you will be doing

Fine-tune and control VLLMs for video and audio understanding.
Design algorithms for balancing, filtering, and curating training and evaluation datasets, informed by model behavior and failure modes.
Implement classic and modern algorithms for processing, clustering, evaluation, and filtering of large-scale datasets.
Work within high-performance, scalable distributed systems capable of handling petabytes of data, with attention to throughput, correctness, and reproducibility.
Collaborate with other researchers and product stakeholders to iteratively improve training sets and evaluation protocols through tight feedback loops driven by model performance.

Your skills and experience

Experience training, fine-tuning, or post-training large-scale VLLMs or multimodal foundation models.
Strong software engineering skills, proficient in Jax or PyTorch.
Ability to develop and implement computer vision models for data filtering and evaluation.
Understanding of relevant topics in statistics and clustering.
Enjoys delving into system implementations to enhance performance and maintainability.

Why Join Us

Benefits

Daily door-to-door shuttles, Car-to-go subscriptions from various locations in central Israel, plus free parking and train-station pickups.
Two chef-led restaurants on-site by the Machneyuda Group, along with a bakery.
Access to cutting-edge tools and learning opportunities for professional growth, including workshops, platform access and training, subscriptions, and clear guidelines for responsible AI use.

Large Scale Video Understanding Research Scientist

About the role

About Lightricks

Team & Role

What you will be doing

Your skills and experience

Why Join Us

Benefits

Large Scale Video Understanding Research Scientist

About the role

About Lightricks

Team & Role

What you will be doing

Your skills and experience

Why Join Us

Benefits

Skills