Dein persönlicher KI-Karriere-Agent
Senior+ AI Infrastructure Engineers(m/w/x)
Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.
Anforderungen
- Model training or inference at scale
- Low-level GPU coding (e.g. CUDA, Triton)
- 5+ years software engineering experience
- Shipping high-quality products or platforms
- Degree in Computer Science, Computer Engineering, or related field
- Equivalent experience with strong fundamentals
- Model training (especially transformers and LLMs)
- Model inference at scale (especially transformers and LLMs)
- Low-level GPU work (e.g. CUDA or Triton kernels)
- Working in production environments at meaningful scale
- Clear communication of technical topics
- Close collaboration with engineers and non-engineers
- Strong technical fundamentals
- Love of learning and self-development
- Deep knowledge of at least one programming language
- Ability to write clean, reliable code
- Ability to learn new stacks quickly
- Experience at AI native companies training/running inference
- Experience running training or inference on Kubernetes
- Experience with AWS or other major cloud providers
- Production experience with Python in ML or infrastructure
- Passion for technology (personal projects, open source, etc.)
Aufgaben
- Implement training pipelines for large transformer and LLM models
- Scale data ingestion and preprocessing processes
- Optimize distributed training and evaluation
- Build low-latency, high-reliability inference services
- Optimize inference services for autoscaling, routing, and fallbacks
- Tune GPU kernels for performance
- Improve GPU utilization
- Identify and resolve bottlenecks in training and inference
- Collaborate with ML scientists on cutting-edge methods
- Bring advanced training and inference methods to production
- Mentor and develop other engineers
- Hire new engineers
- Raise technical standards
- Enhance reliability and operational excellence
Berufserfahrung
- 5 Jahre
Ausbildung
- Bachelor-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- CUDA
- Triton
- Python
- Kubernetes
- AWS
Benefits
Flexibles Arbeiten
- Hybrid working policy
- Flexibility to work from home
Boni & Prämien
- Annual bonus
Attraktive Vergütung
- Equity
- Regular compensation reviews
Sonstige Vorteile
- Unlimited access to Claude Code
Moderne Technikausstattung
- Access to best-in-class AI tools
- MacBook provided
- Windows laptop option
Mehr Urlaubstage
- Generous paid time off
Team Events & Ausflüge
- Fun events
Noch nicht perfekt?
- Prior LabsVollzeitnur vor OrtSeniorFreiburg im Breisgau, Berlin
- Helsing
AI Research Engineer - ML Engineering(m/w/x)
Vollzeitnur vor OrtBerufserfahrenBerlin, München - Langdock
Engineering Department(m/w/x)
Vollzeitnur vor OrtKeine AngabeBerlinab 140.000 / Jahr - SumUp
Senior AI Backend Engineer(m/w/x)
Vollzeitnur vor OrtSeniorBerlin - Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Vollzeitnur vor OrtBerufserfahrenBerlin, Freiburg im Breisgauab 140.000 / Jahr
Senior+ AI Infrastructure Engineers(m/w/x)
Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.
Anforderungen
- Model training or inference at scale
- Low-level GPU coding (e.g. CUDA, Triton)
- 5+ years software engineering experience
- Shipping high-quality products or platforms
- Degree in Computer Science, Computer Engineering, or related field
- Equivalent experience with strong fundamentals
- Model training (especially transformers and LLMs)
- Model inference at scale (especially transformers and LLMs)
- Low-level GPU work (e.g. CUDA or Triton kernels)
- Working in production environments at meaningful scale
- Clear communication of technical topics
- Close collaboration with engineers and non-engineers
- Strong technical fundamentals
- Love of learning and self-development
- Deep knowledge of at least one programming language
- Ability to write clean, reliable code
- Ability to learn new stacks quickly
- Experience at AI native companies training/running inference
- Experience running training or inference on Kubernetes
- Experience with AWS or other major cloud providers
- Production experience with Python in ML or infrastructure
- Passion for technology (personal projects, open source, etc.)
Aufgaben
- Implement training pipelines for large transformer and LLM models
- Scale data ingestion and preprocessing processes
- Optimize distributed training and evaluation
- Build low-latency, high-reliability inference services
- Optimize inference services for autoscaling, routing, and fallbacks
- Tune GPU kernels for performance
- Improve GPU utilization
- Identify and resolve bottlenecks in training and inference
- Collaborate with ML scientists on cutting-edge methods
- Bring advanced training and inference methods to production
- Mentor and develop other engineers
- Hire new engineers
- Raise technical standards
- Enhance reliability and operational excellence
Berufserfahrung
- 5 Jahre
Ausbildung
- Bachelor-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- CUDA
- Triton
- Python
- Kubernetes
- AWS
Benefits
Flexibles Arbeiten
- Hybrid working policy
- Flexibility to work from home
Boni & Prämien
- Annual bonus
Attraktive Vergütung
- Equity
- Regular compensation reviews
Sonstige Vorteile
- Unlimited access to Claude Code
Moderne Technikausstattung
- Access to best-in-class AI tools
- MacBook provided
- Windows laptop option
Mehr Urlaubstage
- Generous paid time off
Team Events & Ausflüge
- Fun events
Über das Unternehmen
Intercom
Branche
IT
Beschreibung
Intercom is the AI Customer Service company on a mission to help businesses provide incredible customer experiences.
Noch nicht perfekt?
- Prior Labs
Senior ML Infrastructure Engineer(m/w/x)
Vollzeitnur vor OrtSeniorFreiburg im Breisgau, Berlin - Helsing
AI Research Engineer - ML Engineering(m/w/x)
Vollzeitnur vor OrtBerufserfahrenBerlin, München - Langdock
Engineering Department(m/w/x)
Vollzeitnur vor OrtKeine AngabeBerlinab 140.000 / Jahr - SumUp
Senior AI Backend Engineer(m/w/x)
Vollzeitnur vor OrtSeniorBerlin - Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Vollzeitnur vor OrtBerufserfahrenBerlin, Freiburg im Breisgauab 140.000 / Jahr