Dein persönlicher KI-Karriere-Agent
Head of AI Engineering(m/w/x)
Productionizing ML models and building unified LLM access for fintech SaaS. 4+ years leading AI/ML production engineering required. 30 vacation days, flexible hours.
Anforderungen
- 5+ years backend engineering experience
- 4+ years leading AI/ML engineering in production
- 10+ years total experience ideal
- Deep architecture expertise in Java (JVM)
- Deep architecture expertise in Node.js (NestJS)
- Distributed systems expertise
- APIs expertise
- Microservices expertise
- Messaging/streaming expertise
- Hands-on with LLM orchestration (LangChain/LlamaIndex or custom)
- Hands-on with vector DBs (Pinecone, Qdrant, FAISS)
- Hands-on with cloud AI (AWS Bedrock)
- Operation of systems at scale (millions of daily API calls)
- Strong SLOs
- Strong observability
- Strong incident management
- MLOps foundations: model registries
- MLOps foundations: experiment tracking
- MLOps foundations: CI/CD
- MLOps foundations: Kubernetes
- MLOps foundations: IaC (Terraform)
- Security best practices
- Excellent communication skills
- Excellent stakeholder management skills
- Strong product sense focused on shipping user-facing features
- Experience with GPU/accelerator serving and optimization (vLLM, TGI, Triton, ONNX Runtime)
- Cost optimization for LLM workloads (token budgets, dynamic routing, caching)
- Evaluation and safety/red-teaming for generative systems
- Startup/high-growth experience
- Right to work in the EU
Aufgaben
- Transform ML team from research-heavy to production-grade
- Partner with Director of AI on strategy
- Build platform unifying LLM access, RAG, and backend services
- Ship reliable, scalable AI features for banking
- Hire, mentor, and develop high-performing team
- Set technical standards, operating rhythms, and review practices
- Organize sub-teams with clear ownership and SLOs
- Manage roadmap, capacity planning, and delivery
- Own LLM gateway with unified APIs and proxy layers
- Implement rate limits, fallbacks, and cost tracking
- Build high-performance RAG pipelines
- Ensure robust observability and safety guardrails
- Define clean async contracts and eventing patterns
- Drive low-latency, scalable inference
- Lead end-to-end model and prompt lifecycle
- Curate data, train, evaluate, and deploy models
- Establish LLMOps/MLOps practices
- Implement CI/CD, canary/A/B tests, and drift monitoring
- Optimize inference throughput and cost
- Translate company goals into AI/ML roadmap
- Balance exploration with reliability and cost
- Manage build-vs-buy/vendor strategy
- Implement data privacy, security, and compliance
- Track prompt/model lineage and reproducibility
- Define incident response and postmortems
Berufserfahrung
- 10 - 15 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – fließend
Tools & Technologien
- Java
- JVM
- Node.js
- NestJS
- LangChain
- LlamaIndex
- Pinecone
- Qdrant
- FAISS
- AWS Bedrock
- Kubernetes
- Terraform
- vLLM
- TGI
- Triton
- ONNX Runtime
Benefits
Lockere Unternehmenskultur
- International & Inclusive Team
- Modern & Dog-friendly Offices
Mehr Urlaubstage
- 30 vacation days
- Half-day off on Christmas Eve
- Half-day off on New Year's Eve
Flexibles Arbeiten
- Flexible working hours
- Hybrid work
Workation & Sabbatical
- Workation
Gesundheits- & Fitnessangebote
- Urban Sports/EGYM Club subsidy
Öffi Tickets
- 50% monthly subsidy for Deutschlandticket
Firmenfahrrad
- JobRad bicycle leasing
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Noch nicht perfekt?
- XentralVollzeitRemoteSeniorAugsburg, Berlin, München, Lohne
- Awin
Senior AI Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin, Hannover, München - VESTIGAS
(Senior) AI Engineer / MLOps Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - OMMAX
Lead AI Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - SQUER
Senior Software Engineer - Applied AI & Data Products(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen
Head of AI Engineering(m/w/x)
Productionizing ML models and building unified LLM access for fintech SaaS. 4+ years leading AI/ML production engineering required. 30 vacation days, flexible hours.
Anforderungen
- 5+ years backend engineering experience
- 4+ years leading AI/ML engineering in production
- 10+ years total experience ideal
- Deep architecture expertise in Java (JVM)
- Deep architecture expertise in Node.js (NestJS)
- Distributed systems expertise
- APIs expertise
- Microservices expertise
- Messaging/streaming expertise
- Hands-on with LLM orchestration (LangChain/LlamaIndex or custom)
- Hands-on with vector DBs (Pinecone, Qdrant, FAISS)
- Hands-on with cloud AI (AWS Bedrock)
- Operation of systems at scale (millions of daily API calls)
- Strong SLOs
- Strong observability
- Strong incident management
- MLOps foundations: model registries
- MLOps foundations: experiment tracking
- MLOps foundations: CI/CD
- MLOps foundations: Kubernetes
- MLOps foundations: IaC (Terraform)
- Security best practices
- Excellent communication skills
- Excellent stakeholder management skills
- Strong product sense focused on shipping user-facing features
- Experience with GPU/accelerator serving and optimization (vLLM, TGI, Triton, ONNX Runtime)
- Cost optimization for LLM workloads (token budgets, dynamic routing, caching)
- Evaluation and safety/red-teaming for generative systems
- Startup/high-growth experience
- Right to work in the EU
Aufgaben
- Transform ML team from research-heavy to production-grade
- Partner with Director of AI on strategy
- Build platform unifying LLM access, RAG, and backend services
- Ship reliable, scalable AI features for banking
- Hire, mentor, and develop high-performing team
- Set technical standards, operating rhythms, and review practices
- Organize sub-teams with clear ownership and SLOs
- Manage roadmap, capacity planning, and delivery
- Own LLM gateway with unified APIs and proxy layers
- Implement rate limits, fallbacks, and cost tracking
- Build high-performance RAG pipelines
- Ensure robust observability and safety guardrails
- Define clean async contracts and eventing patterns
- Drive low-latency, scalable inference
- Lead end-to-end model and prompt lifecycle
- Curate data, train, evaluate, and deploy models
- Establish LLMOps/MLOps practices
- Implement CI/CD, canary/A/B tests, and drift monitoring
- Optimize inference throughput and cost
- Translate company goals into AI/ML roadmap
- Balance exploration with reliability and cost
- Manage build-vs-buy/vendor strategy
- Implement data privacy, security, and compliance
- Track prompt/model lineage and reproducibility
- Define incident response and postmortems
Berufserfahrung
- 10 - 15 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – fließend
Tools & Technologien
- Java
- JVM
- Node.js
- NestJS
- LangChain
- LlamaIndex
- Pinecone
- Qdrant
- FAISS
- AWS Bedrock
- Kubernetes
- Terraform
- vLLM
- TGI
- Triton
- ONNX Runtime
Benefits
Lockere Unternehmenskultur
- International & Inclusive Team
- Modern & Dog-friendly Offices
Mehr Urlaubstage
- 30 vacation days
- Half-day off on Christmas Eve
- Half-day off on New Year's Eve
Flexibles Arbeiten
- Flexible working hours
- Hybrid work
Workation & Sabbatical
- Workation
Gesundheits- & Fitnessangebote
- Urban Sports/EGYM Club subsidy
Öffi Tickets
- 50% monthly subsidy for Deutschlandticket
Firmenfahrrad
- JobRad bicycle leasing
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Über das Unternehmen
neoshare AG
Branche
FinancialServices
Beschreibung
Das Unternehmen ist ein internationales Fintech-Unternehmen, das innovative End-to-End-Lösungen für die Digitalisierung und Verwaltung von Projekt- und Immobilienfinanzierungen anbietet.
Noch nicht perfekt?
- Xentral
Head of AI(m/w/x)
VollzeitRemoteSeniorAugsburg, Berlin, München, Lohne - Awin
Senior AI Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin, Hannover, München - VESTIGAS
(Senior) AI Engineer / MLOps Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - OMMAX
Lead AI Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - SQUER
Senior Software Engineer - Applied AI & Data Products(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen