Research Scientist - Evaluations - AI Safety Institute
As a Research Scientist at the AI Safety Institute, you will help establish robust evaluation frameworks for AI systems and lead projects to assess advanced model capabilities and safeguards. This role involves contributing to foundational AI safety research and collaborating with internal teams and external experts across various workstreams like chem/bio, cyber misuse, and societal impacts.
The AI Safety Institute is the first state-backed organisation focused on advanced AI safety for the public interest. We launched at the AI Safety Summit because we believe taking responsible action on this extraordinary technology requires a capable and empowered group of technical experts within government. Our staff includes senior alumni from OpenAI, Google DeepMind, start-ups and the UK government, and ML professors from Oxford and Cambridge. We are now calling on the world’s top technical talent to build the institute from the ground up. This is a truly unique opportunity to help shape AI safety at an international level.
We have ambitious goals and need to move fast.
As a Research Scientist at AISI, your work will help to set directions for AI system evaluations and to establish robust evaluation frameworks for AI systems. You will lead and contribute to projects designed to be integrated into our evaluation suite, evaluating advanced model capabilities and safeguards, as well as more speculative work aimed at mitigations and system understanding.
We draw on a wide range of disciplines, and value a diversity of research expertise across our five workstreams. You will be primarily associated with one of our workstreams (please specify in your application which you’re most interested in), however, sometimes your work will intersect multiple workstreams.
You will work closely with the Workstream Lead, Research Engineers and other Research Scientists, as well as benefit from support from our cross-functional Platform Team. You will also collaborate with external topic-level experts, contractors, partner organisations and policy makers to coordinate and build on external research.
There will be significant scope to contribute to the strategy of your workstream team and to design experiments with set-ups of increasing complexity.
For this role, you will likely have conducted ML research, research in a domain relevant to your primary workstream, or research at the intersection of your domain and frontier AI systems.
We expect experts in both ML and a specific domain relevant to one of our workstreams to be rare, so we encourage you to apply no matter which research expertise you’re excited to bring to the institute.
We look for some of the following skills, experience and attitudes:
Given the changing nature of the field, it’s most of all important to us to build a team with strong problem-solving skills and a preparedness to acquire any missing knowledge necessary to get the job done.
Owing to the rapid shaping of the field of advanced AI, you will likely be up to date with the latest advancements in advanced AI development.
Posted June 9, 2026