New Job?Nejo!

The AI Job Search Engine

DeepMind

18d ago

Research Scientist, Frontier(m/w/x)

Zürich

Full-timeOn-siteExperienced

AI/ML

Data Science

Nejo AI Summary

Apply now

Description

In this role, you will lead the development of cutting-edge post-training strategies for AI models, focusing on enhancing reasoning and instruction-following capabilities. You will collaborate across teams to ensure high-quality performance across different modalities.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Start AI Job Search

Requirements

•PhD in machine learning, artificial intelligence, or computer science or equivalent practical experience
•Strong background in Large Language Models, Reinforcement Learning, or preference learning
•Research interest in aligning AI systems with human feedback and utility
•Familiarity with experiment design and analyzing large-scale user data
•Strong coding and communication skills
•Experience with RLHF or DPO
•Experience building or improving reward models and conducting human evaluation studies
•Proven track record of publications in top-tier conferences
•Experience with Chain-of-Thought reasoning research or process-based supervision
•Deep understanding and experience training models from scratch or using self-play/self-improvement techniques

Education

Doctoral / PhD

Work Experience

approx. 1 - 4 years

Tasks

•Design and validate novel post-training pipelines for frontier-class models
•Lead research into next-generation Reward Models
•Investigate new architectures for Reward Modeling
•Reduce reward hacking in preference data
•Improve signal-to-noise ratios in preference data
•Develop innovative methods to enhance internal reasoning capabilities
•Focus on correctness and logic in multi-step tasks
•Revamp and optimize RL prompts and feedback mechanisms
•Create robust mechanisms to convert user signals into training data
•Collaborate across teams to apply advanced recipes to various model sizes and modalities

Languages

English – Business Fluent

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of DeepMind and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Not a perfect match?

100+ Similar Jobs in Zürich

DeepMind
Research Engineer, Multimodal Reinforcement Learning(m/w/x)
Full-timeOn-siteExperienced
Zürich
Lakera
Senior Research Engineer - Security Foundation Models(m/w/x)
Full-timeOn-siteSenior
Zürich
Intrinsic
Research Scientist, Deep Learning(m/w/x)
Full-timeOn-siteExperienced
Zürich
Lakera
Research Internship(m/w/x)
Full-timeInternshipOn-site
Zürich
NVIDIA Switzerland AG
Research Scientist, ML Systems - PhD New College Grad(m/w/x)
Full-timeOn-siteExperienced
Zürich

100+ View all similar jobs

DeepMind

18d ago

Research Scientist, Frontier(m/w/x)

Zürich

Full-timeOn-siteExperienced

AI/ML

Data Science

Nejo AI Summary

Apply now

New Job?Nejo!

The AI Job Search Engine

Description

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Start AI Job Search

Requirements

•PhD in machine learning, artificial intelligence, or computer science or equivalent practical experience
•Strong background in Large Language Models, Reinforcement Learning, or preference learning
•Research interest in aligning AI systems with human feedback and utility
•Familiarity with experiment design and analyzing large-scale user data
•Strong coding and communication skills
•Experience with RLHF or DPO
•Experience building or improving reward models and conducting human evaluation studies
•Proven track record of publications in top-tier conferences
•Experience with Chain-of-Thought reasoning research or process-based supervision
•Deep understanding and experience training models from scratch or using self-play/self-improvement techniques

Education

Doctoral / PhD

Work Experience

approx. 1 - 4 years

Tasks

•Design and validate novel post-training pipelines for frontier-class models
•Lead research into next-generation Reward Models
•Investigate new architectures for Reward Modeling
•Reduce reward hacking in preference data
•Improve signal-to-noise ratios in preference data
•Develop innovative methods to enhance internal reasoning capabilities
•Focus on correctness and logic in multi-step tasks
•Revamp and optimize RL prompts and feedback mechanisms
•Create robust mechanisms to convert user signals into training data
•Collaborate across teams to apply advanced recipes to various model sizes and modalities

Languages

English – Business Fluent

About the Company

DeepMind

Industry

Description

The company advances the state of the art in artificial intelligence for public benefit and scientific discovery.

More Jobs

Not a perfect match?

100+ Similar Jobs in Zürich

DeepMind
Research Engineer, Multimodal Reinforcement Learning(m/w/x)
Full-timeOn-siteExperienced
Zürich
Lakera
Senior Research Engineer - Security Foundation Models(m/w/x)
Full-timeOn-siteSenior
Zürich
Intrinsic
Research Scientist, Deep Learning(m/w/x)
Full-timeOn-siteExperienced
Zürich
Lakera
Research Internship(m/w/x)
Full-timeInternshipOn-site
Zürich
NVIDIA Switzerland AG
Research Scientist, ML Systems - PhD New College Grad(m/w/x)
Full-timeOn-siteExperienced
Zürich

100+ View all similar jobs

Research Scientist, Frontier(m/w/x)

Description

Requirements

Education

Work Experience

Tasks

Languages

Research Engineer, Multimodal Reinforcement Learning(m/w/x)

Senior Research Engineer - Security Foundation Models(m/w/x)

Research Scientist, Deep Learning(m/w/x)

Research Internship(m/w/x)

Research Scientist, ML Systems - PhD New College Grad(m/w/x)

Research Scientist, Frontier(m/w/x)

Description

Requirements

Education

Work Experience

Tasks

Languages

About the Company

Research Engineer, Multimodal Reinforcement Learning(m/w/x)

Senior Research Engineer - Security Foundation Models(m/w/x)

Research Scientist, Deep Learning(m/w/x)

Research Internship(m/w/x)

Research Scientist, ML Systems - PhD New College Grad(m/w/x)