Dein persönlicher KI-Karriere-Agent
Senior Deep Learning Engineer(m/w/x)
Improving inference speed and deploying Cosmos WFMs on GPU platforms for AI/HPC breakthroughs. MSc/PhD in CS/EE or equivalent, with Python/PyTorch expertise, essential. Work on cutting-edge AI/HPC projects.
Anforderungen
- 8+ years of experience
- MSc or PhD in CS, EE, CSEE or equivalent experience
- Strong background in Deep Learning
- Strong programming skills in Python and PyTorch
- Experience with inference optimization techniques
- Experience with inference optimization frameworks: TensorRT, TensorRT-LLM, vLLM, SGLang
- Familiarity with deploying Deep Learning models in production settings
- CUDA programming experience
- Familiarity with diffusion models
- Proven experience in analyzing, modeling, and tuning GPU workloads
Aufgaben
- Improve inference speed for Cosmos WFMs on GPU platforms
- Deploy Cosmos WFMs into production
- Profile and analyze deep learning workloads to identify bottlenecks
- Remove bottlenecks in deep learning processes
Berufserfahrung
- 8 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- PyTorch
- TensorRT
- TensorRT-LLM
- vLLM
- SGLang
- Docker
- Triton Inference Server
- CUDA
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Noch nicht perfekt?
- Analog Devices, Inc.Vollzeitmit HomeofficeSeniorMünchen
- NVIDIA Germany
Senior Solutions Architect, AI Factory(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - Axelera AI
Senior/Staff Applications Engineer - Embedded AI(m/w/x)
VollzeitRemoteBerufserfahrenZürich, München - OMMAX
(Senior) AI Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMünchen - OMMAX
(Senior) MLOps Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMünchen
Senior Deep Learning Engineer(m/w/x)
Improving inference speed and deploying Cosmos WFMs on GPU platforms for AI/HPC breakthroughs. MSc/PhD in CS/EE or equivalent, with Python/PyTorch expertise, essential. Work on cutting-edge AI/HPC projects.
Anforderungen
- 8+ years of experience
- MSc or PhD in CS, EE, CSEE or equivalent experience
- Strong background in Deep Learning
- Strong programming skills in Python and PyTorch
- Experience with inference optimization techniques
- Experience with inference optimization frameworks: TensorRT, TensorRT-LLM, vLLM, SGLang
- Familiarity with deploying Deep Learning models in production settings
- CUDA programming experience
- Familiarity with diffusion models
- Proven experience in analyzing, modeling, and tuning GPU workloads
Aufgaben
- Improve inference speed for Cosmos WFMs on GPU platforms
- Deploy Cosmos WFMs into production
- Profile and analyze deep learning workloads to identify bottlenecks
- Remove bottlenecks in deep learning processes
Berufserfahrung
- 8 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- PyTorch
- TensorRT
- TensorRT-LLM
- vLLM
- SGLang
- Docker
- Triton Inference Server
- CUDA
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Über das Unternehmen
DE01 NVIDIA Germany
Branche
IT
Beschreibung
The company is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization.
Noch nicht perfekt?
- Analog Devices, Inc.
Staff AI Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - NVIDIA Germany
Senior Solutions Architect, AI Factory(m/w/x)
Vollzeitmit HomeofficeSeniorMünchen - Axelera AI
Senior/Staff Applications Engineer - Embedded AI(m/w/x)
VollzeitRemoteBerufserfahrenZürich, München - OMMAX
(Senior) AI Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMünchen - OMMAX
(Senior) MLOps Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenMünchen