Skip to content
New Job?Nejo!

The AI Job Search Engine

NE
Nebius
7h ago

Senior ML Engineer - Token Factory(m/w/x)

Berlin
Full-timeWith Home OfficeSenior
AI/ML
Data Science

Description

You will push foundation models to their hardware limits by optimizing inference and training pipelines across a massive GPU cloud to maximize throughput and minimize latency.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • Understanding of machine learning foundations
  • Experience profiling GPU workloads
  • Understanding of GPU memory hierarchy
  • Familiarity with LLM architectures
  • Understanding of neural network training
  • Strong software engineering skills
  • Experience with deep learning frameworks
  • Proficiency in CI/CD and versioning
  • Strong communication and leadership abilities
  • Experience with open-source inference engines
  • Experience with kernel languages
  • Track record of delivering products
  • Experience developing large distributed systems
  • Open-source projects showcasing engineering prowess
  • Excellent command of English language

Education

Bachelor's degree
OR
Master's degree
OR
Doctoral / PhD

Work Experience

approx. 4 - 6 years

Tasks

  • Identify LLM inference bottlenecks
  • Drive production speedups
  • Maximize performance for LLM architectures
  • Support and optimize inference engines
  • Implement novel speculative decoding architectures
  • Optimize dense and MoE components
  • Contribute to open-source inference engines
  • Design low-precision training pipelines
  • Productionize FP8 and NVFP4 inference
  • Improve throughput and cost-efficiency

Tools & Technologies

NsightPyTorch profilerPythonCI/CDvLLMSGLangTensorRT-LLMTritonCuteCUTLASSCUDA

Languages

EnglishBusiness Fluent

Benefits

Flexible Working

  • Flexible working arrangements

Competitive Pay

  • Competitive salary

Other Benefits

  • Comprehensive benefits package

Career Advancement

  • Professional growth opportunities

Informal Culture

  • Dynamic and collaborative work environment
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Nebius and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Berlin
  • FactoryPal

    Senior Machine Learning Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • acto

    Senior/Staff AI Engineer - Agentic Systems(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin, München
  • Super.AI

    Machine Learning Engineer(m/w/x)

    Full-timeWith HomeofficeExperienced
    Berlin
  • AUTO1 Group

    Senior Machine Learning Platform/Ops Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • Smartly

    Senior Machine Learning Engineer - AI Platform(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
100+ View all similar jobs