Skip to content
Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

NEneoshare AG

Head of AI Engineering(m/w/x)

München, Frankfurt am Main, Berlin
Vollzeitmit HomeofficeSenior
AI/ML
Data Science

Productionizing ML models and building unified LLM access for fintech SaaS. 4+ years leading AI/ML production engineering required. 30 vacation days, flexible hours.

Anforderungen

  • 5+ years backend engineering experience
  • 4+ years leading AI/ML engineering in production
  • 10+ years total experience ideal
  • Deep architecture expertise in Java (JVM)
  • Deep architecture expertise in Node.js (NestJS)
  • Distributed systems expertise
  • APIs expertise
  • Microservices expertise
  • Messaging/streaming expertise
  • Hands-on with LLM orchestration (LangChain/LlamaIndex or custom)
  • Hands-on with vector DBs (Pinecone, Qdrant, FAISS)
  • Hands-on with cloud AI (AWS Bedrock)
  • Operation of systems at scale (millions of daily API calls)
  • Strong SLOs
  • Strong observability
  • Strong incident management
  • MLOps foundations: model registries
  • MLOps foundations: experiment tracking
  • MLOps foundations: CI/CD
  • MLOps foundations: Kubernetes
  • MLOps foundations: IaC (Terraform)
  • Security best practices
  • Excellent communication skills
  • Excellent stakeholder management skills
  • Strong product sense focused on shipping user-facing features
  • Experience with GPU/accelerator serving and optimization (vLLM, TGI, Triton, ONNX Runtime)
  • Cost optimization for LLM workloads (token budgets, dynamic routing, caching)
  • Evaluation and safety/red-teaming for generative systems
  • Startup/high-growth experience
  • Right to work in the EU

Aufgaben

  • Transform ML team from research-heavy to production-grade
  • Partner with Director of AI on strategy
  • Build platform unifying LLM access, RAG, and backend services
  • Ship reliable, scalable AI features for banking
  • Hire, mentor, and develop high-performing team
  • Set technical standards, operating rhythms, and review practices
  • Organize sub-teams with clear ownership and SLOs
  • Manage roadmap, capacity planning, and delivery
  • Own LLM gateway with unified APIs and proxy layers
  • Implement rate limits, fallbacks, and cost tracking
  • Build high-performance RAG pipelines
  • Ensure robust observability and safety guardrails
  • Define clean async contracts and eventing patterns
  • Drive low-latency, scalable inference
  • Lead end-to-end model and prompt lifecycle
  • Curate data, train, evaluate, and deploy models
  • Establish LLMOps/MLOps practices
  • Implement CI/CD, canary/A/B tests, and drift monitoring
  • Optimize inference throughput and cost
  • Translate company goals into AI/ML roadmap
  • Balance exploration with reliability and cost
  • Manage build-vs-buy/vendor strategy
  • Implement data privacy, security, and compliance
  • Track prompt/model lineage and reproducibility
  • Define incident response and postmortems

Berufserfahrung

  • 10 - 15 Jahre

Ausbildung

  • Bachelor-AbschlussODER
  • Master-Abschluss

Sprachen

  • Englischfließend

Tools & Technologien

  • Java
  • JVM
  • Node.js
  • NestJS
  • LangChain
  • LlamaIndex
  • Pinecone
  • Qdrant
  • FAISS
  • AWS Bedrock
  • Kubernetes
  • Terraform
  • vLLM
  • TGI
  • Triton
  • ONNX Runtime

Benefits

Lockere Unternehmenskultur

  • International & Inclusive Team
  • Modern & Dog-friendly Offices

Mehr Urlaubstage

  • 30 vacation days
  • Half-day off on Christmas Eve
  • Half-day off on New Year's Eve

Flexibles Arbeiten

  • Flexible working hours
  • Hybrid work

Workation & Sabbatical

  • Workation

Gesundheits- & Fitnessangebote

  • Urban Sports/EGYM Club subsidy

Öffi Tickets

  • 50% monthly subsidy for Deutschlandticket

Firmenfahrrad

  • JobRad bicycle leasing
Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens neoshare AG erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.


  • Xentral

    Head of AI(m/w/x)

    VollzeitRemoteSenior
    Augsburg, Berlin, München, Lohne
  • Awin

    Senior AI Engineer(m/w/x)

    Vollzeitmit HomeofficeSenior
    Berlin, Hannover, München
  • VESTIGAS

    (Senior) AI Engineer / MLOps Engineer(m/w/x)

    Vollzeitmit HomeofficeSenior
    München
  • OMMAX

    Lead AI Engineer(m/w/x)

    Vollzeitmit HomeofficeSenior
    München
  • SQUER

    Senior Software Engineer - Applied AI & Data Products(m/w/x)

    Vollzeitmit HomeofficeSenior
    München
Alle 100+ ähnlichen Jobs ansehen

Nejo ist eine KI – Ergebnisse können unvollständig sein oder Fehler enthalten