Skip to content
New Job?Nejo!

The AI Job Search Engine

ALAleph Alpha

Senior AI Researcher- Reinforcement learning(m/w/x)

Heidelberg
Full-timeWith Home OfficeSenior
AI/ML

Large-scale experiments and code-base maintenance for general-purpose model methodology at AI lab with 50+ researchers. Proven experience in multi-node LLM training and RL theory required. Virtual Stock Option Plan, 30 days vacation.

Requirements

  • Deep understanding of Reinforcement Learning theory
  • Experience with multi-node LLM training
  • Familiarity with statistical evaluation methods
  • Ability to analyze evaluation environments
  • Strong Python and ML tooling skills
  • Willingness to relocate or travel
  • PhD in RL or equivalent research
  • Contributions to top-tier RL venues
  • Experience evaluating LLM models

Tasks

  • Shape and improve underlying RL methodology
  • Maintain a high-quality training code-base
  • Conduct large-scale reinforcement learning experiments
  • Derive hypotheses from experimental results
  • Iterate on implementation and methodology
  • Execute large-scale LLM training runs
  • Analyze evaluation scores in depth
  • Propose and implement performance improvements
  • Maximize performance on internal benchmarks
  • Identify and implement novel multi-turn RL approaches
  • Stay current with bleeding-edge RL research
  • Identify and resolve training infrastructure bottlenecks
  • Optimize RL loops for large-scale training
  • Partner with post-training teams on feedback
  • Convert raw feedback into actionable training signals
  • Ensure RL iterations improve downstream performance

Work Experience

  • approx. 4 - 6 years

Education

  • Doctoral / PhD

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • Python
  • torch distributed
  • LLM
  • ML tooling

Benefits

Flexible Working

  • Flexible working hours
  • Hybrid working model

Competitive Pay

  • Virtual Stock Option Plan

More Vacation Days

  • 30 days paid vacation

Healthcare & Fitness

  • Fitness & wellness offerings

Mental Health Support

  • Mental health support

Retirement Plans

  • Subsidized company pension plan

Public Transport Subsidies

  • Subsidized transportation ticket

Additional Allowances

  • Technical equipment budget

Company Bike

  • JobRad Bike Lease
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Aleph Alpha and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Buhl Data Service GmbH

    Senior AI / Data Science Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Mannheim
  • SAP

    Principal Machine Learning Expert/ Development Architect(m/w/x)

    Full-timeWith HomeofficeSenior
    Walldorf
  • Aleph Alpha

    Senior Performance Engineer- Pretraining(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
  • Exxeta

    Senior Data Scientist - Physical AI & Computer Vision(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin, Karlsruhe, Mannheim
  • ABB AG

    Senior Scientist – Agentic AI and Applications(m/w/x)

    Full-timeWith HomeofficeSenior
    Mannheim
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes