The AI Job Search Engine
Senior ML Engineer - Token Factory(m/w/x)
Description
You will push foundation models to their hardware limits by optimizing inference and training pipelines across a massive GPU cloud to maximize throughput and minimize latency.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Understanding of machine learning foundations
- •Experience profiling GPU workloads
- •Understanding of GPU memory hierarchy
- •Familiarity with LLM architectures
- •Understanding of neural network training
- •Strong software engineering skills
- •Experience with deep learning frameworks
- •Proficiency in CI/CD and versioning
- •Strong communication and leadership abilities
- •Experience with open-source inference engines
- •Experience with kernel languages
- •Track record of delivering products
- •Experience developing large distributed systems
- •Open-source projects showcasing engineering prowess
- •Excellent command of English language
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Identify LLM inference bottlenecks
- •Drive production speedups
- •Maximize performance for LLM architectures
- •Support and optimize inference engines
- •Implement novel speculative decoding architectures
- •Optimize dense and MoE components
- •Contribute to open-source inference engines
- •Design low-precision training pipelines
- •Productionize FP8 and NVFP4 inference
- •Improve throughput and cost-efficiency
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Flexible working arrangements
Competitive Pay
- •Competitive salary
Other Benefits
- •Comprehensive benefits package
Career Advancement
- •Professional growth opportunities
Informal Culture
- •Dynamic and collaborative work environment
- FactoryPalFull-timeWith HomeofficeSeniorBerlin
- acto
Senior/Staff AI Engineer - Agentic Systems(m/w/x)
Full-timeWith HomeofficeSeniorBerlin, München - Super.AI
Machine Learning Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - AUTO1 Group
Senior Machine Learning Platform/Ops Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Smartly
Senior Machine Learning Engineer - AI Platform(m/w/x)
Full-timeWith HomeofficeSeniorBerlin
Senior ML Engineer - Token Factory(m/w/x)
The AI Job Search Engine
Description
You will push foundation models to their hardware limits by optimizing inference and training pipelines across a massive GPU cloud to maximize throughput and minimize latency.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Understanding of machine learning foundations
- •Experience profiling GPU workloads
- •Understanding of GPU memory hierarchy
- •Familiarity with LLM architectures
- •Understanding of neural network training
- •Strong software engineering skills
- •Experience with deep learning frameworks
- •Proficiency in CI/CD and versioning
- •Strong communication and leadership abilities
- •Experience with open-source inference engines
- •Experience with kernel languages
- •Track record of delivering products
- •Experience developing large distributed systems
- •Open-source projects showcasing engineering prowess
- •Excellent command of English language
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Identify LLM inference bottlenecks
- •Drive production speedups
- •Maximize performance for LLM architectures
- •Support and optimize inference engines
- •Implement novel speculative decoding architectures
- •Optimize dense and MoE components
- •Contribute to open-source inference engines
- •Design low-precision training pipelines
- •Productionize FP8 and NVFP4 inference
- •Improve throughput and cost-efficiency
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Flexible working arrangements
Competitive Pay
- •Competitive salary
Other Benefits
- •Comprehensive benefits package
Career Advancement
- •Professional growth opportunities
Informal Culture
- •Dynamic and collaborative work environment
About the Company
Nebius
Industry
IT
Description
The company is leading a new era in cloud computing to serve the global AI economy by creating tools and resources for real-world challenges.
- FactoryPal
Senior Machine Learning Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - acto
Senior/Staff AI Engineer - Agentic Systems(m/w/x)
Full-timeWith HomeofficeSeniorBerlin, München - Super.AI
Machine Learning Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - AUTO1 Group
Senior Machine Learning Platform/Ops Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Smartly
Senior Machine Learning Engineer - AI Platform(m/w/x)
Full-timeWith HomeofficeSeniorBerlin