New Job?Nejo!

Your personal AI career agent

INIntercom

last mo.

Senior+ AI Infrastructure Engineers(m/w/x)

Berlin

Full-timeOn-siteSenior

AI/ML

Nejo AI Summary

Apply now

Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.

Requirements

Model training or inference at scale
Low-level GPU coding (e.g. CUDA, Triton)
5+ years software engineering experience
Shipping high-quality products or platforms
Degree in Computer Science, Computer Engineering, or related field
Equivalent experience with strong fundamentals
Model training (especially transformers and LLMs)
Model inference at scale (especially transformers and LLMs)
Low-level GPU work (e.g. CUDA or Triton kernels)
Working in production environments at meaningful scale
Clear communication of technical topics
Close collaboration with engineers and non-engineers
Strong technical fundamentals
Love of learning and self-development
Deep knowledge of at least one programming language
Ability to write clean, reliable code
Ability to learn new stacks quickly
Experience at AI native companies training/running inference
Experience running training or inference on Kubernetes
Experience with AWS or other major cloud providers
Production experience with Python in ML or infrastructure
Passion for technology (personal projects, open source, etc.)

Tasks

Implement training pipelines for large transformer and LLM models
Scale data ingestion and preprocessing processes
Optimize distributed training and evaluation
Build low-latency, high-reliability inference services
Optimize inference services for autoscaling, routing, and fallbacks
Tune GPU kernels for performance
Improve GPU utilization
Identify and resolve bottlenecks in training and inference
Collaborate with ML scientists on cutting-edge methods
Bring advanced training and inference methods to production
Mentor and develop other engineers
Hire new engineers
Raise technical standards
Enhance reliability and operational excellence

Work Experience

5 years

Education

Bachelor's degree

Languages

English – Business Fluent

Tools & Technologies

CUDA
Triton
Python
Kubernetes
AWS

Benefits

Flexible Working

Hybrid working policy
Flexibility to work from home

Bonuses & Incentives

Annual bonus

Competitive Pay

Equity
Regular compensation reviews

Other Benefits

Unlimited access to Claude Code

Modern Equipment

Access to best-in-class AI tools
MacBook provided
Windows laptop option

More Vacation Days

Generous paid time off

Team Events

Fun events

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Intercom and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Like this job?

Beta

Your Career Agent finds similar jobs for you every day.

Not a perfect match?

100+ Similar Jobs in Berlin View all

Prior Labs
Senior ML Infrastructure Engineer(m/w/x)
Full-timeOn-siteSenior
Freiburg im Breisgau, Berlin
Helsing
AI Research Engineer - ML Engineering(m/w/x)
Full-timeOn-siteExperienced
Berlin, München
Langdock
Engineering Department(m/w/x)
Full-timeOn-siteNot specified
Berlin
from 140,000 / year
SumUp
Senior AI Backend Engineer(m/w/x)
Full-timeOn-siteSenior
Berlin
Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Full-timeOn-siteExperienced
Berlin, Freiburg im Breisgau
from 140,000 / year

View all 100+ similar jobs

INIntercom

last mo.