Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

INIntercom

letzten Monat

Senior+ AI Infrastructure Engineers(m/w/x)

Berlin

VollzeitVor OrtSenior

AI/ML

Nejo KI-Zusammenfassung

Jetzt bewerben

Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.

Anforderungen

Model training or inference at scale
Low-level GPU coding (e.g. CUDA, Triton)
5+ years software engineering experience
Shipping high-quality products or platforms
Degree in Computer Science, Computer Engineering, or related field
Equivalent experience with strong fundamentals
Model training (especially transformers and LLMs)
Model inference at scale (especially transformers and LLMs)
Low-level GPU work (e.g. CUDA or Triton kernels)
Working in production environments at meaningful scale
Clear communication of technical topics
Close collaboration with engineers and non-engineers
Strong technical fundamentals
Love of learning and self-development
Deep knowledge of at least one programming language
Ability to write clean, reliable code
Ability to learn new stacks quickly
Experience at AI native companies training/running inference
Experience running training or inference on Kubernetes
Experience with AWS or other major cloud providers
Production experience with Python in ML or infrastructure
Passion for technology (personal projects, open source, etc.)

Aufgaben

Implement training pipelines for large transformer and LLM models
Scale data ingestion and preprocessing processes
Optimize distributed training and evaluation
Build low-latency, high-reliability inference services
Optimize inference services for autoscaling, routing, and fallbacks
Tune GPU kernels for performance
Improve GPU utilization
Identify and resolve bottlenecks in training and inference
Collaborate with ML scientists on cutting-edge methods
Bring advanced training and inference methods to production
Mentor and develop other engineers
Hire new engineers
Raise technical standards
Enhance reliability and operational excellence

Berufserfahrung

5 Jahre

Ausbildung

Bachelor-Abschluss

Sprachen

Englisch – verhandlungssicher

Tools & Technologien

CUDA
Triton
Python
Kubernetes
AWS

Benefits

Flexibles Arbeiten

Hybrid working policy
Flexibility to work from home

Boni & Prämien

Annual bonus

Attraktive Vergütung

Equity
Regular compensation reviews

Sonstige Vorteile

Unlimited access to Claude Code

Moderne Technikausstattung

Access to best-in-class AI tools
MacBook provided
Windows laptop option

Mehr Urlaubstage

Generous paid time off

Team Events & Ausflüge

Fun events

Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens Intercom erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.

Noch nicht perfekt?

Prior Labs
Senior ML Infrastructure Engineer(m/w/x)
Vollzeitnur vor OrtSenior
Freiburg im Breisgau, Berlin
Helsing
AI Research Engineer - ML Engineering(m/w/x)
Vollzeitnur vor OrtBerufserfahren
Berlin, München
Langdock
Engineering Department(m/w/x)
Vollzeitnur vor OrtKeine Angabe
Berlin
ab 140.000 / Jahr
SumUp
Senior AI Backend Engineer(m/w/x)
Vollzeitnur vor OrtSenior
Berlin
Prior Labs
ML Engineer, Cloud Platform(m/w/x)
Vollzeitnur vor OrtBerufserfahren
Berlin, Freiburg im Breisgau
ab 140.000 / Jahr

Alle 100+ ähnlichen Jobs ansehen

INIntercom

letzten Monat