Your personal AI career agent
Senior Deep Learning Engineer(m/w/x)
Improving inference speed and deploying Cosmos WFMs on GPU platforms for AI/HPC breakthroughs. MSc/PhD in CS/EE or equivalent, with Python/PyTorch expertise, essential. Work on cutting-edge AI/HPC projects.
Requirements
- 8+ years of experience
- MSc or PhD in CS, EE, CSEE or equivalent experience
- Strong background in Deep Learning
- Strong programming skills in Python and PyTorch
- Experience with inference optimization techniques
- Experience with inference optimization frameworks: TensorRT, TensorRT-LLM, vLLM, SGLang
- Familiarity with deploying Deep Learning models in production settings
- CUDA programming experience
- Familiarity with diffusion models
- Proven experience in analyzing, modeling, and tuning GPU workloads
Tasks
- Improve inference speed for Cosmos WFMs on GPU platforms
- Deploy Cosmos WFMs into production
- Profile and analyze deep learning workloads to identify bottlenecks
- Remove bottlenecks in deep learning processes
Work Experience
- 8 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- PyTorch
- TensorRT
- TensorRT-LLM
- vLLM
- SGLang
- Docker
- Triton Inference Server
- CUDA
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- Analog Devices, Inc.Full-timeWith HomeofficeSeniorMünchen
- NVIDIA Germany
Senior Solutions Architect, AI Factory(m/w/x)
Full-timeWith HomeofficeSeniorMünchen - OMMAX
(Senior) AI Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedMünchen - OMMAX
(Senior) MLOps Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedMünchen - Awin
Senior AI Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin, Hannover, München
Senior Deep Learning Engineer(m/w/x)
Improving inference speed and deploying Cosmos WFMs on GPU platforms for AI/HPC breakthroughs. MSc/PhD in CS/EE or equivalent, with Python/PyTorch expertise, essential. Work on cutting-edge AI/HPC projects.
Requirements
- 8+ years of experience
- MSc or PhD in CS, EE, CSEE or equivalent experience
- Strong background in Deep Learning
- Strong programming skills in Python and PyTorch
- Experience with inference optimization techniques
- Experience with inference optimization frameworks: TensorRT, TensorRT-LLM, vLLM, SGLang
- Familiarity with deploying Deep Learning models in production settings
- CUDA programming experience
- Familiarity with diffusion models
- Proven experience in analyzing, modeling, and tuning GPU workloads
Tasks
- Improve inference speed for Cosmos WFMs on GPU platforms
- Deploy Cosmos WFMs into production
- Profile and analyze deep learning workloads to identify bottlenecks
- Remove bottlenecks in deep learning processes
Work Experience
- 8 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- PyTorch
- TensorRT
- TensorRT-LLM
- vLLM
- SGLang
- Docker
- Triton Inference Server
- CUDA
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
DE01 NVIDIA Germany
Industry
IT
Description
The company is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization.
Not a perfect match?
- Analog Devices, Inc.
Staff AI Engineer(m/w/x)
Full-timeWith HomeofficeSeniorMünchen - NVIDIA Germany
Senior Solutions Architect, AI Factory(m/w/x)
Full-timeWith HomeofficeSeniorMünchen - OMMAX
(Senior) AI Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedMünchen - OMMAX
(Senior) MLOps Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedMünchen - Awin
Senior AI Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin, Hannover, München