Your personal AI career agent
Senior+ AI Infrastructure Engineers(m/w/x)
Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.
Requirements
- Model training or inference at scale
- Low-level GPU coding (e.g. CUDA, Triton)
- 5+ years software engineering experience
- Shipping high-quality products or platforms
- Degree in Computer Science, Computer Engineering, or related field
- Equivalent experience with strong fundamentals
- Model training (especially transformers and LLMs)
- Model inference at scale (especially transformers and LLMs)
- Low-level GPU work (e.g. CUDA or Triton kernels)
- Working in production environments at meaningful scale
- Clear communication of technical topics
- Close collaboration with engineers and non-engineers
- Strong technical fundamentals
- Love of learning and self-development
- Deep knowledge of at least one programming language
- Ability to write clean, reliable code
- Ability to learn new stacks quickly
- Experience at AI native companies training/running inference
- Experience running training or inference on Kubernetes
- Experience with AWS or other major cloud providers
- Production experience with Python in ML or infrastructure
- Passion for technology (personal projects, open source, etc.)
Tasks
- Implement training pipelines for large transformer and LLM models
- Scale data ingestion and preprocessing processes
- Optimize distributed training and evaluation
- Build low-latency, high-reliability inference services
- Optimize inference services for autoscaling, routing, and fallbacks
- Tune GPU kernels for performance
- Improve GPU utilization
- Identify and resolve bottlenecks in training and inference
- Collaborate with ML scientists on cutting-edge methods
- Bring advanced training and inference methods to production
- Mentor and develop other engineers
- Hire new engineers
- Raise technical standards
- Enhance reliability and operational excellence
Work Experience
- 5 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- CUDA
- Triton
- Python
- Kubernetes
- AWS
Benefits
Flexible Working
- Hybrid working policy
- Flexibility to work from home
Bonuses & Incentives
- Annual bonus
Competitive Pay
- Equity
- Regular compensation reviews
Other Benefits
- Unlimited access to Claude Code
Modern Equipment
- Access to best-in-class AI tools
- MacBook provided
- Windows laptop option
More Vacation Days
- Generous paid time off
Team Events
- Fun events
Not a perfect match?
- Prior LabsFull-timeOn-siteSeniorFreiburg im Breisgau, Berlin
- Helsing
AI Research Engineer - ML Engineering(m/w/x)
Full-timeOn-siteExperiencedBerlin, München - Langdock
Engineering Department(m/w/x)
Full-timeOn-siteNot specifiedBerlinfrom 140,000 / year - SumUp
Senior AI Backend Engineer(m/w/x)
Full-timeOn-siteSeniorBerlin - Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Full-timeOn-siteExperiencedBerlin, Freiburg im Breisgaufrom 140,000 / year
Senior+ AI Infrastructure Engineers(m/w/x)
Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.
Requirements
- Model training or inference at scale
- Low-level GPU coding (e.g. CUDA, Triton)
- 5+ years software engineering experience
- Shipping high-quality products or platforms
- Degree in Computer Science, Computer Engineering, or related field
- Equivalent experience with strong fundamentals
- Model training (especially transformers and LLMs)
- Model inference at scale (especially transformers and LLMs)
- Low-level GPU work (e.g. CUDA or Triton kernels)
- Working in production environments at meaningful scale
- Clear communication of technical topics
- Close collaboration with engineers and non-engineers
- Strong technical fundamentals
- Love of learning and self-development
- Deep knowledge of at least one programming language
- Ability to write clean, reliable code
- Ability to learn new stacks quickly
- Experience at AI native companies training/running inference
- Experience running training or inference on Kubernetes
- Experience with AWS or other major cloud providers
- Production experience with Python in ML or infrastructure
- Passion for technology (personal projects, open source, etc.)
Tasks
- Implement training pipelines for large transformer and LLM models
- Scale data ingestion and preprocessing processes
- Optimize distributed training and evaluation
- Build low-latency, high-reliability inference services
- Optimize inference services for autoscaling, routing, and fallbacks
- Tune GPU kernels for performance
- Improve GPU utilization
- Identify and resolve bottlenecks in training and inference
- Collaborate with ML scientists on cutting-edge methods
- Bring advanced training and inference methods to production
- Mentor and develop other engineers
- Hire new engineers
- Raise technical standards
- Enhance reliability and operational excellence
Work Experience
- 5 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- CUDA
- Triton
- Python
- Kubernetes
- AWS
Benefits
Flexible Working
- Hybrid working policy
- Flexibility to work from home
Bonuses & Incentives
- Annual bonus
Competitive Pay
- Equity
- Regular compensation reviews
Other Benefits
- Unlimited access to Claude Code
Modern Equipment
- Access to best-in-class AI tools
- MacBook provided
- Windows laptop option
More Vacation Days
- Generous paid time off
Team Events
- Fun events
About the Company
Intercom
Industry
IT
Description
Intercom is the AI Customer Service company on a mission to help businesses provide incredible customer experiences.
Not a perfect match?
- Prior Labs
Senior ML Infrastructure Engineer(m/w/x)
Full-timeOn-siteSeniorFreiburg im Breisgau, Berlin - Helsing
AI Research Engineer - ML Engineering(m/w/x)
Full-timeOn-siteExperiencedBerlin, München - Langdock
Engineering Department(m/w/x)
Full-timeOn-siteNot specifiedBerlinfrom 140,000 / year - SumUp
Senior AI Backend Engineer(m/w/x)
Full-timeOn-siteSeniorBerlin - Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Full-timeOn-siteExperiencedBerlin, Freiburg im Breisgaufrom 140,000 / year