Die KI-Suchmaschine für Jobs
Senior AI Researcher- Reinforcement learning(m/w/x)
Large-scale experiments and code-base maintenance for general-purpose model methodology at AI lab with 50+ researchers. Proven experience in multi-node LLM training and RL theory required. Virtual Stock Option Plan, 30 days vacation.
Anforderungen
- Deep understanding of Reinforcement Learning theory
- Experience with multi-node LLM training
- Familiarity with statistical evaluation methods
- Ability to analyze evaluation environments
- Strong Python and ML tooling skills
- Willingness to relocate or travel
- PhD in RL or equivalent research
- Contributions to top-tier RL venues
- Experience evaluating LLM models
Aufgaben
- Shape and improve underlying RL methodology
- Maintain a high-quality training code-base
- Conduct large-scale reinforcement learning experiments
- Derive hypotheses from experimental results
- Iterate on implementation and methodology
- Execute large-scale LLM training runs
- Analyze evaluation scores in depth
- Propose and implement performance improvements
- Maximize performance on internal benchmarks
- Identify and implement novel multi-turn RL approaches
- Stay current with bleeding-edge RL research
- Identify and resolve training infrastructure bottlenecks
- Optimize RL loops for large-scale training
- Partner with post-training teams on feedback
- Convert raw feedback into actionable training signals
- Ensure RL iterations improve downstream performance
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Doktor / Ph.D.
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- torch distributed
- LLM
- ML tooling
Benefits
Flexibles Arbeiten
- Flexible working hours
- Hybrid working model
Attraktive Vergütung
- Virtual Stock Option Plan
Mehr Urlaubstage
- 30 days paid vacation
Gesundheits- & Fitnessangebote
- Fitness & wellness offerings
Mentale Gesundheitsförderung
- Mental health support
Betriebliche Altersvorsorge
- Subsidized company pension plan
Öffi Tickets
- Subsidized transportation ticket
Sonstige Zulagen
- Technical equipment budget
Firmenfahrrad
- JobRad Bike Lease
Noch nicht perfekt?
- Buhl Data Service GmbHVollzeitmit HomeofficeSeniorMannheim
- SAP
Principal Machine Learning Expert/ Development Architect(m/w/x)
Vollzeitmit HomeofficeSeniorWalldorf - Aleph Alpha
Senior Performance Engineer- Pretraining(m/w/x)
Vollzeitmit HomeofficeSeniorHeidelberg - ABB AG
(Senior) Scientist – AI and Graphs(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMannheim - ABB AG
Senior Scientist – Agentic AI and Applications(m/w/x)
Vollzeitmit HomeofficeSeniorMannheim
Senior AI Researcher- Reinforcement learning(m/w/x)
Large-scale experiments and code-base maintenance for general-purpose model methodology at AI lab with 50+ researchers. Proven experience in multi-node LLM training and RL theory required. Virtual Stock Option Plan, 30 days vacation.
Anforderungen
- Deep understanding of Reinforcement Learning theory
- Experience with multi-node LLM training
- Familiarity with statistical evaluation methods
- Ability to analyze evaluation environments
- Strong Python and ML tooling skills
- Willingness to relocate or travel
- PhD in RL or equivalent research
- Contributions to top-tier RL venues
- Experience evaluating LLM models
Aufgaben
- Shape and improve underlying RL methodology
- Maintain a high-quality training code-base
- Conduct large-scale reinforcement learning experiments
- Derive hypotheses from experimental results
- Iterate on implementation and methodology
- Execute large-scale LLM training runs
- Analyze evaluation scores in depth
- Propose and implement performance improvements
- Maximize performance on internal benchmarks
- Identify and implement novel multi-turn RL approaches
- Stay current with bleeding-edge RL research
- Identify and resolve training infrastructure bottlenecks
- Optimize RL loops for large-scale training
- Partner with post-training teams on feedback
- Convert raw feedback into actionable training signals
- Ensure RL iterations improve downstream performance
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Doktor / Ph.D.
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- torch distributed
- LLM
- ML tooling
Benefits
Flexibles Arbeiten
- Flexible working hours
- Hybrid working model
Attraktive Vergütung
- Virtual Stock Option Plan
Mehr Urlaubstage
- 30 days paid vacation
Gesundheits- & Fitnessangebote
- Fitness & wellness offerings
Mentale Gesundheitsförderung
- Mental health support
Betriebliche Altersvorsorge
- Subsidized company pension plan
Öffi Tickets
- Subsidized transportation ticket
Sonstige Zulagen
- Technical equipment budget
Firmenfahrrad
- JobRad Bike Lease
Über das Unternehmen
Aleph Alpha
Branche
IT
Beschreibung
The company develops cutting-edge generative AI solutions with a strong emphasis on sovereignty, ethical development, and societal benefit.
Noch nicht perfekt?
- Buhl Data Service GmbH
Senior AI / Data Science Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMannheim - SAP
Principal Machine Learning Expert/ Development Architect(m/w/x)
Vollzeitmit HomeofficeSeniorWalldorf - Aleph Alpha
Senior Performance Engineer- Pretraining(m/w/x)
Vollzeitmit HomeofficeSeniorHeidelberg - ABB AG
(Senior) Scientist – AI and Graphs(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMannheim - ABB AG
Senior Scientist – Agentic AI and Applications(m/w/x)
Vollzeitmit HomeofficeSeniorMannheim