Skip to content
New Job?Nejo!

Your personal AI career agent

ALAleph Alpha

Senior AI Software Engineer - Model Evaluation(m/w/x)

Heidelberg
Full-timeWith Home OfficeSenior
AI/ML

Designing and implementing evaluation methodologies for foundational AI models in industrial applications. LLM evaluation, benchmark design, and Python skills required. 30 days vacation, hybrid work, and wellness offerings.

Requirements

  • Experience with LLM evaluation, benchmark design, dataset curation, and experimental design
  • Familiarity with statistical methods for evaluation and experiment design
  • Track record of shipping impactful technical work (research, infrastructure, or both)
  • Strong Python skills and comfort with ML tooling
  • Ability to reason about evaluation measures and their relevance
  • Ownership mentality: problem diagnosis to solution deployment
  • Willingness to relocate to Heidelberg or travel regularly
  • Understanding of foundation model training (data, scale, architecture effects)
  • Experience with large-scale data processing or ML infrastructure
  • German language proficiency (helpful for evaluating German capabilities, not required)
  • PhD in machine learning, NLP, statistics, or related field (valued but not required)

Tasks

  • Design and implement evaluation methodologies
  • Select and maintain evaluation datasets
  • Develop scoring infrastructure for pre-training
  • Optimize evaluation pipelines for speed and reliability
  • Build tools for benchmark result interpretation
  • Identify and address model capability gaps
  • Create or integrate new benchmarks
  • Ensure rigorous German language evaluation
  • Correlate pre-training metrics with performance outcomes

Work Experience

  • approx. 4 - 6 years

Education

  • Doctoral / PhD

Languages

  • GermanBasic

Tools & Technologies

  • Python
  • PyTorch
  • LLM evaluation
  • ML tooling
  • evaluation frameworks
  • distributed systems
  • foundation model training
  • large-scale data processing
  • ML infrastructure

Benefits

Flexible Working

  • Flexible working hours
  • Hybrid working model

More Vacation Days

  • 30 days of paid vacation

Healthcare & Fitness

  • Access to fitness & wellness offerings

Mental Health Support

  • Mental health support

Retirement Plans

  • Subsidized company pension plan

Public Transport Subsidies

  • Subsidized Germany-wide transportation ticket

Additional Allowances

  • Budget for additional technical equipment

Competitive Pay

  • Virtual Stock Option Plan

Company Bike

  • Bike Lease
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Aleph Alpha and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Aleph Alpha

    Senior AI Researcher - Pre-training Data(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
  • Aleph Alpha

    Senior AI Engineer – Pre-training Data(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
  • Aleph Alpha

    Senior Performance Engineer- Pretraining(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
  • Buhl Data Service GmbH

    Senior AI / Data Science Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Mannheim
  • Aleph Alpha

    Senior AI Researcher- Reinforcement learning(m/w/x)

    Full-timeWith HomeofficeSenior
    Heidelberg
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes