Research Scientist, Gemini Safety

Remote • Zurich, Switzerland

Posted 3mo ago

Job Location

Zurich, Switzerland

Tech Stack

Fine-Tuning Reinforcement Learning JAX Gemini LLMs Supervised Fine Tuning Safety Fairness Alignment

Remote Work Policy

Fully remote

About the job

The Gemini Safety team at Google DeepMind is responsible for ensuring the safety and fairness of GDM's latest Gemini models. As a Research Scientist, you will apply and develop cutting-edge data and algorithmic solutions to advance these user-facing models. This is a fast-paced, highly collaborative role within a supportive team environment. Our work drives the development of foundational technology adopted by numerous product areas, including the Gemini App, Cloud API, and Search.

Responsibilities

Post-training and instruction tuning state-of-the-art LLMs across text-to-text, image/video/audio-to-text modalities, and agentic capabilities.
Exploring data, reasoning, and algorithmic solutions to ensure Gemini Models are safe, maximally helpful, and work for everyone.
Improving Gemini's adversarial robustness, with a focus on high-stakes abuse risks.
Designing and maintaining high-quality evaluation protocols to assess model behavior gaps and headroom related to safety and fairness.
Developing and executing experimental plans to address known gaps or construct new capabilities.
Driving innovation and enhancing understanding of Supervised Fine Tuning and Reinforcement Learning fine-tuning at scale.

Requirements

PhD in Computer Science or a related field, or equivalent practical experience.
Significant LLM post-training experience.
Experience in Reward modeling and Reinforcement Learning for LLMs Instruction tuning.
Experience with Long-range Reinforcement learning.
Experience in areas such as Safety, Fairness, and Alignment.
Track record of publications at NeurIPS, ICLR, ICML.
Experience taking research from concept to product.
Experience with collaborating or leading an applied research project.
Strong experimental taste and good judgment regarding baselines, ablations, and what is worth testing.
Experience with JAX.

About Google DeepMind

View company profile