Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

NEneoshare AG

vor 22 Tagen

Head of AI Engineering(m/w/x)

München, Frankfurt am Main, Berlin

Vollzeitmit HomeofficeSenior

AI/ML

Data Science

Nejo KI-Zusammenfassung

Jetzt bewerben

Productionizing ML models and building unified LLM access for fintech SaaS. 4+ years leading AI/ML production engineering required. 30 vacation days, flexible hours.

Anforderungen

5+ years backend engineering experience
4+ years leading AI/ML engineering in production
10+ years total experience ideal
Deep architecture expertise in Java (JVM)
Deep architecture expertise in Node.js (NestJS)
Distributed systems expertise
APIs expertise
Microservices expertise
Messaging/streaming expertise
Hands-on with LLM orchestration (LangChain/LlamaIndex or custom)
Hands-on with vector DBs (Pinecone, Qdrant, FAISS)
Hands-on with cloud AI (AWS Bedrock)
Operation of systems at scale (millions of daily API calls)
Strong SLOs
Strong observability
Strong incident management
MLOps foundations: model registries
MLOps foundations: experiment tracking
MLOps foundations: CI/CD
MLOps foundations: Kubernetes
MLOps foundations: IaC (Terraform)
Security best practices
Excellent communication skills
Excellent stakeholder management skills
Strong product sense focused on shipping user-facing features
Experience with GPU/accelerator serving and optimization (vLLM, TGI, Triton, ONNX Runtime)
Cost optimization for LLM workloads (token budgets, dynamic routing, caching)
Evaluation and safety/red-teaming for generative systems
Startup/high-growth experience
Right to work in the EU

Aufgaben

Transform ML team from research-heavy to production-grade
Partner with Director of AI on strategy
Build platform unifying LLM access, RAG, and backend services
Ship reliable, scalable AI features for banking
Hire, mentor, and develop high-performing team
Set technical standards, operating rhythms, and review practices
Organize sub-teams with clear ownership and SLOs
Manage roadmap, capacity planning, and delivery
Own LLM gateway with unified APIs and proxy layers
Implement rate limits, fallbacks, and cost tracking
Build high-performance RAG pipelines
Ensure robust observability and safety guardrails
Define clean async contracts and eventing patterns
Drive low-latency, scalable inference
Lead end-to-end model and prompt lifecycle
Curate data, train, evaluate, and deploy models
Establish LLMOps/MLOps practices
Implement CI/CD, canary/A/B tests, and drift monitoring
Optimize inference throughput and cost
Translate company goals into AI/ML roadmap
Balance exploration with reliability and cost
Manage build-vs-buy/vendor strategy
Implement data privacy, security, and compliance
Track prompt/model lineage and reproducibility
Define incident response and postmortems

Berufserfahrung

10 - 15 Jahre

Ausbildung

Bachelor-AbschlussODER
Master-Abschluss

Sprachen

Englisch – fließend

Tools & Technologien

Java
JVM
Node.js
NestJS
LangChain
LlamaIndex
Pinecone
Qdrant
FAISS
AWS Bedrock
Kubernetes
Terraform
vLLM
TGI
Triton
ONNX Runtime

Benefits

Lockere Unternehmenskultur

International & Inclusive Team
Modern & Dog-friendly Offices

Mehr Urlaubstage

30 vacation days
Half-day off on Christmas Eve
Half-day off on New Year's Eve

Flexibles Arbeiten

Flexible working hours
Hybrid work

Workation & Sabbatical

Workation

Gesundheits- & Fitnessangebote

Urban Sports/EGYM Club subsidy

Öffi Tickets

50% monthly subsidy for Deutschlandticket

Firmenfahrrad

JobRad bicycle leasing

Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens neoshare AG erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.

Noch nicht perfekt?

Awin
Senior AI Engineer(m/w/x)
Vollzeitmit HomeofficeSenior
Berlin, Hannover, München
OMMAX
Lead AI Engineer(m/w/x)
Vollzeitmit HomeofficeSenior
München
VESTIGAS
(Senior) AI Engineer / MLOps Engineer(m/w/x)
Vollzeitmit HomeofficeSenior
München
appliedAI Initiative GmbH
AI Engineer - Focus: Software Engineering(m/w/x)
Vollzeitmit HomeofficeBerufserfahren
Heilbronn, München
SQUER
Senior Software Engineer - Applied AI & Data Products(m/w/x)
Vollzeitmit HomeofficeSenior
München

Alle 100+ ähnlichen Jobs ansehen

NEneoshare AG

vor 22 Tagen