New Job?Nejo!

Your personal AI career agent

ALAleph Alpha

3mo ago

Senior AI Researcher- Reinforcement learning(m/w/x)

Heidelberg

Full-timeWith Home OfficeSenior

AI/ML

Nejo AI Summary

Apply now

Large-scale experiments and code-base maintenance for general-purpose model methodology at AI lab with 50+ researchers. Proven experience in multi-node LLM training and RL theory required. Virtual Stock Option Plan, 30 days vacation.

Requirements

Deep understanding of Reinforcement Learning theory
Experience with multi-node LLM training
Familiarity with statistical evaluation methods
Ability to analyze evaluation environments
Strong Python and ML tooling skills
Willingness to relocate or travel
PhD in RL or equivalent research
Contributions to top-tier RL venues
Experience evaluating LLM models

Tasks

Shape and improve underlying RL methodology
Maintain a high-quality training code-base
Conduct large-scale reinforcement learning experiments
Derive hypotheses from experimental results
Iterate on implementation and methodology
Execute large-scale LLM training runs
Analyze evaluation scores in depth
Propose and implement performance improvements
Maximize performance on internal benchmarks
Identify and implement novel multi-turn RL approaches
Stay current with bleeding-edge RL research
Identify and resolve training infrastructure bottlenecks
Optimize RL loops for large-scale training
Partner with post-training teams on feedback
Convert raw feedback into actionable training signals
Ensure RL iterations improve downstream performance

Work Experience

approx. 4 - 6 years

Education

Doctoral / PhD

Languages

English – Business Fluent

Tools & Technologies

Python
torch distributed
LLM
ML tooling

Benefits

Flexible Working

Flexible working hours
Hybrid working model

Competitive Pay

Virtual Stock Option Plan

More Vacation Days

30 days paid vacation

Healthcare & Fitness

Fitness & wellness offerings

Mental Health Support

Mental health support

Retirement Plans

Subsidized company pension plan

Public Transport Subsidies

Subsidized transportation ticket

Additional Allowances

Technical equipment budget

Company Bike

JobRad Bike Lease

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Aleph Alpha and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Like this job?

Beta

Your Career Agent finds similar jobs for you every day.

Not a perfect match?

Aleph Alpha
Senior AI Researcher - Pre-training Data(m/w/x)
Full-timeWith HomeofficeSenior
Heidelberg
Aleph Alpha
Senior AI Software Engineer – Model Training(m/w/x)
Full-timeWith HomeofficeSenior
Heidelberg, Berlin
Aleph Alpha
Senior AI Software Engineer - Model Evaluation(m/w/x)
Full-timeWith HomeofficeSenior
Heidelberg
Aleph Alpha
Senior AI Engineer – Pre-training Data(m/w/x)
Full-timeWith HomeofficeSenior
Heidelberg
Buhl Data Service GmbH
Senior AI / Data Science Engineer(m/w/x)
Full-timeWith HomeofficeSenior
Mannheim

View all 100+ similar jobs

ALAleph Alpha

3mo ago