Researcher, Robustness & Safety Training
OpenAI is seeking a Senior Researcher focused on Robustness & Safety Training to advance AI safety research and ensure the secure deployment of AGI systems. This role involves conducting state-of-the-art research in areas like RLHF and adversarial training, implementing safety improvements in products, and collaborating with cross-functional teams to establish high safety standards.
The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit society. This team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.
The Model Safety Research team aims to fundamentally advance our capabilities for precisely implementing robust, safe behavior in AI models, and to leverage these advances to make OpenAI’s deployed models safe and beneficial. This requires a breadth of new ML research to address the growing set of safety challenges as AI becomes more powerful and used in more settings. Key focus areas include how to enforce nuanced safety policies without trading off helpfulness and capabilities, how to make the model robust to adversaries, how to address privacy and security risks, and how to make the model trustworthy in safety-critical domains.
We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely.
OpenAI is seeking a senior researcher with a passion for AI safety and experience in safety research. Your role will set directions for research to enable and empower safe AGI and work on research projects to make our AI systems safer, more aligned, and more robust to adversarial or malicious use cases. You will play a critical role in shaping how a safe AI system should look like in the future at OpenAI, making a significant impact on our mission to build and deploy safe AGI.
Posted June 8, 2026