Skip to content
New Job?Nejo!

Your personal AI career agent

INIntercom

Senior+ AI Infrastructure Engineers(m/w/x)

Berlin
Full-timeOn-siteSenior
AI/ML

Implementing training pipelines for transformer and LLM models at AI customer service company. Low-level GPU coding (CUDA, Triton) required. Hybrid work, annual bonus, and equity.

Requirements

  • Model training or inference at scale
  • Low-level GPU coding (e.g. CUDA, Triton)
  • 5+ years software engineering experience
  • Shipping high-quality products or platforms
  • Degree in Computer Science, Computer Engineering, or related field
  • Equivalent experience with strong fundamentals
  • Model training (especially transformers and LLMs)
  • Model inference at scale (especially transformers and LLMs)
  • Low-level GPU work (e.g. CUDA or Triton kernels)
  • Working in production environments at meaningful scale
  • Clear communication of technical topics
  • Close collaboration with engineers and non-engineers
  • Strong technical fundamentals
  • Love of learning and self-development
  • Deep knowledge of at least one programming language
  • Ability to write clean, reliable code
  • Ability to learn new stacks quickly
  • Experience at AI native companies training/running inference
  • Experience running training or inference on Kubernetes
  • Experience with AWS or other major cloud providers
  • Production experience with Python in ML or infrastructure
  • Passion for technology (personal projects, open source, etc.)

Tasks

  • Implement training pipelines for large transformer and LLM models
  • Scale data ingestion and preprocessing processes
  • Optimize distributed training and evaluation
  • Build low-latency, high-reliability inference services
  • Optimize inference services for autoscaling, routing, and fallbacks
  • Tune GPU kernels for performance
  • Improve GPU utilization
  • Identify and resolve bottlenecks in training and inference
  • Collaborate with ML scientists on cutting-edge methods
  • Bring advanced training and inference methods to production
  • Mentor and develop other engineers
  • Hire new engineers
  • Raise technical standards
  • Enhance reliability and operational excellence

Work Experience

  • 5 years

Education

  • Bachelor's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • CUDA
  • Triton
  • Python
  • Kubernetes
  • AWS

Benefits

Flexible Working

  • Hybrid working policy
  • Flexibility to work from home

Bonuses & Incentives

  • Annual bonus

Competitive Pay

  • Equity
  • Regular compensation reviews

Other Benefits

  • Unlimited access to Claude Code

Modern Equipment

  • Access to best-in-class AI tools
  • MacBook provided
  • Windows laptop option

More Vacation Days

  • Generous paid time off

Team Events

  • Fun events
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Intercom and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Prior Labs

    Senior ML Infrastructure Engineer(m/w/x)

    Full-timeOn-siteSenior
    Freiburg im Breisgau, Berlin
  • Helsing

    AI Research Engineer - ML Engineering(m/w/x)

    Full-timeOn-siteExperienced
    Berlin, München
  • Langdock

    Engineering Department(m/w/x)

    Full-timeOn-siteNot specified
    Berlin
    from 140,000 / year
  • SumUp

    Senior AI Backend Engineer(m/w/x)

    Full-timeOn-siteSenior
    Berlin
  • Prior Labs

    ML Engineer, Cloud Platform(m/w/x)

    Full-timeOn-siteExperienced
    Berlin, Freiburg im Breisgau
    from 140,000 / year
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes