Research Engineer, Data
As a Research Engineer - Data, you will be responsible for building and driving the data foundation for Periodic Labs' research efforts. This involves owning the end-to-end data strategy, from sourcing and procuring external datasets to integrating internal experimental data, ensuring researchers have the right data for training and improving frontier models. You will work at the intersection of data engineering, research infrastructure, and strategy, collaborating with ML researchers to build data pipelines and systems.
Periodic Labs is an AI and physical sciences company building state-of-the-art models to accelerate breakthroughs across materials, energy, and beyond. Backed by world-class investors and growing rapidly, we operate at the pace the frontier requires. Our team brings deep expertise, genuine ownership, and an insatiable drive to push the boundaries of what’s scientifically possible.
You will build and drive the data foundation for our research efforts. This means owning data strategy end-to-end: sourcing and procuring external datasets, integrating internally generated experimental data into the training stack, and ensuring the team always has the right data — in the right shape — to train and improve frontier models.
This role sits at the intersection of data engineering, research infrastructure, and strategy. You will work closely with pretraining, midtraining, and RL researchers to understand what data the models need, then build the pipelines and systems to get it there. The work spans collecting and organizing diverse data sources, improving data quality through deduplication and preprocessing, and ensuring that new experimental results are incorporated in a structured, repeatable way that makes them useful for model development.
Posted June 11, 2026