New Job?Nejo!

The AI Job Search Engine

DE
DeepMind
18d ago

Research Scientist, Frontier(m/w/x)

Zürich
Full-timeOn-siteExperienced
AI/ML
Data Science

Description

In this role, you will lead the development of cutting-edge post-training strategies for AI models, focusing on enhancing reasoning and instruction-following capabilities. You will collaborate across teams to ensure high-quality performance across different modalities.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • PhD in machine learning, artificial intelligence, or computer science or equivalent practical experience
  • Strong background in Large Language Models, Reinforcement Learning, or preference learning
  • Research interest in aligning AI systems with human feedback and utility
  • Familiarity with experiment design and analyzing large-scale user data
  • Strong coding and communication skills
  • Experience with RLHF or DPO
  • Experience building or improving reward models and conducting human evaluation studies
  • Proven track record of publications in top-tier conferences
  • Experience with Chain-of-Thought reasoning research or process-based supervision
  • Deep understanding and experience training models from scratch or using self-play/self-improvement techniques

Education

Doctoral / PhD

Work Experience

approx. 1 - 4 years

Tasks

  • Design and validate novel post-training pipelines for frontier-class models
  • Lead research into next-generation Reward Models
  • Investigate new architectures for Reward Modeling
  • Reduce reward hacking in preference data
  • Improve signal-to-noise ratios in preference data
  • Develop innovative methods to enhance internal reasoning capabilities
  • Focus on correctness and logic in multi-step tasks
  • Revamp and optimize RL prompts and feedback mechanisms
  • Create robust mechanisms to convert user signals into training data
  • Collaborate across teams to apply advanced recipes to various model sizes and modalities

Languages

EnglishBusiness Fluent

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of DeepMind and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Zürich
  • DeepMind

    Research Engineer, Multimodal Reinforcement Learning(m/w/x)

    Full-timeOn-siteExperienced
    Zürich
  • Lakera

    Senior Research Engineer - Security Foundation Models(m/w/x)

    Full-timeOn-siteSenior
    Zürich
  • Intrinsic

    Research Scientist, Deep Learning(m/w/x)

    Full-timeOn-siteExperienced
    Zürich
  • Lakera

    Research Internship(m/w/x)

    Full-timeInternshipOn-site
    Zürich
  • NVIDIA Switzerland AG

    Research Scientist, ML Systems - PhD New College Grad(m/w/x)

    Full-timeOn-siteExperienced
    Zürich
100+ View all similar jobs