Research Engineer - Evaluations - AI Safety Institute
The AI Safety Institute is seeking a Research Engineer - Evaluations to build and maintain scientific software, enabling high-quality research on advanced AI systems. This role involves developing and conducting evaluations, driving foundational AI safety research, and facilitating information exchange within a cross-functional team. The Research Engineer will contribute to various workstreams, from chemical/biological misuse to autonomous systems, and play a crucial role in building bespoke infrastructure and tools for research projects.
The AI Safety Institute is the first state-backed organisation focused on advanced AI safety for the public interest. We launched at the AI Safety Summit because we believe taking responsible action on this extraordinary technology requires a capable and empowered group of technical experts within government. Our staff includes senior alumni from OpenAI, Google DeepMind, start-ups and the UK government, and ML professors from Oxford and Cambridge. We are now calling on the world’s top technical talent to build the institute from the ground up. This is a truly unique opportunity to help shape AI safety at an international level. We have ambitious goals and need to move fast.
Research Engineers build and maintain scientific software to enable high quality research. They are uniquely placed to bridge the world of software engineering and research, and at the AI Safety Institute will be involved in challenging and diverse projects at the cutting edge of advanced AI development.
As a Research Engineer you will either be embedded within one or more of our research teams, or you will sit in a cross-cutting group of Research Engineers within the Platform Team.
In either case you will be collaborating with research scientists and people running evaluations and user studies on the one hand, and with our Platform Team on the other. You might also on-board, run and improve existing evaluations from the wider research community, as well as up-scaling new evaluation methods developed in-house.
We draw on a wide range of disciplines, and value a diversity of research expertise across our five workstreams. You will be primarily associated with one of our workstreams (please specify in your application which you’re most interested in), however, sometimes your work will intersect multiple workstreams.
The Platform Team will be providing the foundational infrastructure for our research projects. You will build on top of our platform to create bespoke, load-bearing infrastructure and tools for individual research projects. You will be able to independently run and analyse your own experiments to diagnose problems and understand our research work and tech stack in detail.
You will spend your time working not just on infrastructure code but also in the planning and execution of research projects, such as a wide range of evaluations of cutting-edge AI systems. This includes working on analysing and visualising the outcomes of complex evaluation or fine-tuning procedures and managing large data sets.
As a research engineer it is your responsibility to make the hard trade-offs between when code needs to be load-bearing enough to support multiple experiments and when it is better to write “good enough" code to quickly prove or disprove a hypothesis. In this you will work very closely with our Research Scientists who will often be the main users for the tools you build.
This role may be a great fit if you:
Alongside your salary of £85,000, Department for Science, Innovation & Technology contributes £11,473 towards you being a member of the Civil Service Defined Benefit Pension scheme. Find out what benefits a Civil Service Pension provides.
The Department for Science, Innovation and Technology offers a competitive mix of benefits including:
Posted June 2, 2026