Research Engineer, Multimodal Reasoning For Information Literacy
As a Research Engineer at Google DeepMind, you will develop and apply multimodal reasoning systems and Vision-Language Models (VLMs) to assess the trustworthiness of online media. This role involves rapid prototyping, designing and training multimodal models, and engaging with product teams to advance research in information literacy.
At Google DeepMind, our research team is dedicated to tackling the most complex challenges in online information quality. We strive to advance the state of the art by developing innovative solutions to detect manipulated media and misleading narratives, ensuring the integrity of digital discourse. Our interdisciplinary work spans provenance analysis and the creation of tools for AI-assisted information literacy, leveraging our technologies for the widespread public benefit of a safer online environment. We thrive in a supportive environment that encourages rapid prototyping and iteration, driving our research achievements directly into Google’s flagship models, including Gemini.
To succeed in this role, you will need to be passionate about advancing information literacy using machine learning and other computational techniques. You'll join an interdisciplinary team of domain experts, ML researchers, and engineers to research and build multimodal reasoning systems and Vision-Language Models (VLMs) to assess the trustworthiness of media (images, audio, and videos) on the internet.
When assessing technical background we will take a holistic view of the mix of scientific, ML and computational experience. We do not expect you to be an expert in all fields simultaneously.
Posted May 26, 2026