Skip to content
New Job?Nejo!

The AI Job Search Engine

ALAleph Alpha

Senior AI Researcher- Reinforcement learning(m/w/x)

Heidelberg
Full-timeWith Home OfficeSenior
AI/ML

Large-scale experiments and code-base maintenance for general-purpose model methodology at AI lab with 50+ researchers. Proven experience in multi-node LLM training and RL theory required. Virtual Stock Option Plan, 30 days vacation.

Requirements

  • Deep understanding of Reinforcement Learning theory
  • Experience with multi-node LLM training
  • Familiarity with statistical evaluation methods
  • Ability to analyze evaluation environments
  • Strong Python and ML tooling skills
  • Willingness to relocate or travel
  • PhD in RL or equivalent research
  • Contributions to top-tier RL venues
  • Experience evaluating LLM models

Tasks

  • Shape and improve underlying RL methodology
  • Maintain a high-quality training code-base
  • Conduct large-scale reinforcement learning experiments
  • Derive hypotheses from experimental results
  • Iterate on implementation and methodology
  • Execute large-scale LLM training runs
  • Analyze evaluation scores in depth
  • Propose and implement performance improvements
  • Maximize performance on internal benchmarks
  • Identify and implement novel multi-turn RL approaches
  • Stay current with bleeding-edge RL research
  • Identify and resolve training infrastructure bottlenecks
  • Optimize RL loops for large-scale training
  • Partner with post-training teams on feedback
  • Convert raw feedback into actionable training signals
  • Ensure RL iterations improve downstream performance

Work Experience

approx. 4 - 6 years

Education

Doctoral / PhD

Languages

EnglishBusiness Fluent

Tools & Technologies

Pythontorch distributedLLMML tooling

Benefits

Flexible Working

  • Flexible working hours
  • Hybrid working model

Competitive Pay

  • Virtual Stock Option Plan

More Vacation Days

  • 30 days paid vacation

Healthcare & Fitness

  • Fitness & wellness offerings

Mental Health Support

  • Mental health support

Retirement Plans

  • Subsidized company pension plan

Public Transport Subsidies

  • Subsidized transportation ticket

Additional Allowances

  • Technical equipment budget

Company Bike

  • JobRad Bike Lease
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Aleph Alpha and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Heidelberg
  • Buhl Data Service GmbH

    Senior AI / Data Science Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Mannheim
  • Aleph Alpha

    Senior Performance Engineer- Pretraining(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
  • SAP

    Principal Machine Learning Expert/ Development Architect(m/w/x)

    Full-timeWith HomeofficeSenior
    Walldorf
  • ABB AG

    Senior Scientist – Agentic AI and Applications(m/w/x)

    Full-timeWith HomeofficeSenior
    Mannheim
  • ABB AG

    (Senior) Scientist – AI and Graphs(m/w/x)

    Full-timeWith HomeofficeExperienced
    Mannheim
100+ View all similar jobs

Nejo is an AI – results may be incomplete or contain mistakes