Research Scientist, Reinforcement Learning
As a Research Scientist specializing in Reinforcement Learning, you will lead efforts to develop a deeper understanding of intelligence principles within the MARA project. This role involves advancing model-based RL, exploration strategies, and optimal control, and applying these methods to real-world challenges like AutumnBench and robotics environments. You will conduct independent and collaborative research, develop new algorithms, and disseminate findings through publications and presentations.
Research scientists lead Basis’ efforts to develop a deeper understanding of the conceptual, mathematical, and computational principles of intelligence. We are looking for people who are technically excellent, and who value probing concepts at their foundations. Our research scientists/engineers aspire to do rigorous, high-quality, robust science, but are not afraid to tinker, make mistakes, and explore radically different ideas in order to get there. Basis is a collaborative effort, both internally and with our external partners; we are looking for people who enjoy working with others on problems larger than ones they can tackle alone.
Our research within the MARA project aims to develop new foundations and technologies for modeling, abstraction, and reasoning in AI systems. MARA’s overarching goal is to uncover principled methods for how intelligence constructs, refines, and utilizes world models through interactive experimentation.
For this role, we are specifically looking for experts in Reinforcement Learning & Planning who can advance the state of the art in model-based RL, exploration strategies, optimal control, and Bayesian optimization. You will work on developing agents that can learn efficient policies in complex, partially observable environments by leveraging structured world models.
The immediate mission of MARA is to solve concrete challenges such as AutumnBench, physical and simulated robotics benchmarks, and the Abstract Reasoning Corpus (ARC), with the broader mission of building systems capable of learning in an open, growing portfolio of domains using human-comparable amounts of data and interaction.
Posted June 10, 2026