Your personal AI career agent
Senior System Software Engineer – Embedded AI Inference(m/w/x)
Designing and deploying C++ agentic AI inference solutions with PyTorch integration for real-time GPU deployment. High-performance safety-critical software experience required. Hybrid work, 4-day work week.
Requirements
- 8+ years professional software engineering experience
- Experience in high-performance safety-critical software
- Experience in automotive, robotics, or real-time systems
- Master's or PhD in Computer Science or Machine Learning
- Strong modern C++ (C++14/17 or later)
- C++ templates, RAII, smart pointers, STL
- Experience building large C++ codebases
- Solid Python skills for tooling
- Solid Python skills for training scripts
- Solid Python skills for glue code
- Hands-on experience building agentic AI frameworks
- Experience with LLM / VLM inference
- Experience with LLM and VLM inference optimization
- Experience with speculative decoding, LoRA, MoE
- Experience developing on Linux
- Experience with Linux build systems (CMake)
- Experience with Linux debugging (gdb, sanitizers)
- Experience with Linux profiling
- Experience with git-based workflows in CI/CD
- Familiarity with GPU programming and optimization
- Familiarity with TensorRT
- Experience with agentic AI
- Experience with agents based on edge-friendly models (2–7B)
- Experience with agent context management
- Experience with reliable agent tool calling
- Experience with agent MCP
- Experience with agentic coding
- Direct experience with NVIDIA DRIVE AGX platform
- Knowledge of AI model optimization
- Knowledge of AI model deployment
- Knowledge of quantization (INT8, FP8, 4-bit)
- Familiarity with high-performance LLM inference frameworks
- Familiarity with TensorRT-LLM or ONNX Runtime
- Understanding of software quality practices for safety-critical systems
- Knowledge of code review, unit testing, static analysis
- Automotive standards knowledge is a plus
- Open-source contributions in AI, robotics, or GPU computing
- Published work in AI, robotics, or GPU computing
Tasks
- Design C++ agentic AI and AI inference solutions
- Implement C++ agentic AI and AI inference solutions
- Maintain C++ agentic AI and AI inference solutions
- Integrate PyTorch Deep Learning models into C++ pipelines
- Deploy AI models for real-time inference on NVIDIA GPUs
- Build testable, modular libraries and components
- Extend testable, modular libraries and components
- Develop interfaces to models
- Develop interfaces to sensor drivers
- Develop interfaces to vehicle control
- Profile C++ and CUDA code
- Debug C++ and CUDA code
- Optimize C++ and CUDA code for latency and throughput
- Collaborate with ML researchers
- Collaborate with systems engineers
- Collaborate with automotive partners
- Turn prototype algorithms into production-ready implementations
- Solve technical problems in deep learning
- Solve technical problems in real-time systems
- Solve technical problems in production software engineering
Work Experience
- 8 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Python
- Linux
- CMake
- gdb
- git
- CI/CD
- TensorRT
- NVIDIA DRIVE AGX
- INT8
- FP8
- 4-bit
- TensorRT-LLM
- ONNX Runtime
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- NVIDIAFull-timeOn-siteSeniorMünchen
- NVIDIA
Senior Software Engineer – ADAS(m/w/x)
Full-timeOn-siteSeniorMünchen - NVIDIA Germany
System Software Integration Engineer - Autonomous Vehicles(m/w/x)
Full-timeOn-siteExperiencedMünchen - Intrinsic
Senior Software Engineer, ML Ops & Infrastructure(m/w/x)
Full-timeOn-siteSeniorMünchen - NVIDIA
Senior Embedded Solutions Architect(m/w/x)
Full-timeOn-siteSeniorMünchen
Senior System Software Engineer – Embedded AI Inference(m/w/x)
Designing and deploying C++ agentic AI inference solutions with PyTorch integration for real-time GPU deployment. High-performance safety-critical software experience required. Hybrid work, 4-day work week.
Requirements
- 8+ years professional software engineering experience
- Experience in high-performance safety-critical software
- Experience in automotive, robotics, or real-time systems
- Master's or PhD in Computer Science or Machine Learning
- Strong modern C++ (C++14/17 or later)
- C++ templates, RAII, smart pointers, STL
- Experience building large C++ codebases
- Solid Python skills for tooling
- Solid Python skills for training scripts
- Solid Python skills for glue code
- Hands-on experience building agentic AI frameworks
- Experience with LLM / VLM inference
- Experience with LLM and VLM inference optimization
- Experience with speculative decoding, LoRA, MoE
- Experience developing on Linux
- Experience with Linux build systems (CMake)
- Experience with Linux debugging (gdb, sanitizers)
- Experience with Linux profiling
- Experience with git-based workflows in CI/CD
- Familiarity with GPU programming and optimization
- Familiarity with TensorRT
- Experience with agentic AI
- Experience with agents based on edge-friendly models (2–7B)
- Experience with agent context management
- Experience with reliable agent tool calling
- Experience with agent MCP
- Experience with agentic coding
- Direct experience with NVIDIA DRIVE AGX platform
- Knowledge of AI model optimization
- Knowledge of AI model deployment
- Knowledge of quantization (INT8, FP8, 4-bit)
- Familiarity with high-performance LLM inference frameworks
- Familiarity with TensorRT-LLM or ONNX Runtime
- Understanding of software quality practices for safety-critical systems
- Knowledge of code review, unit testing, static analysis
- Automotive standards knowledge is a plus
- Open-source contributions in AI, robotics, or GPU computing
- Published work in AI, robotics, or GPU computing
Tasks
- Design C++ agentic AI and AI inference solutions
- Implement C++ agentic AI and AI inference solutions
- Maintain C++ agentic AI and AI inference solutions
- Integrate PyTorch Deep Learning models into C++ pipelines
- Deploy AI models for real-time inference on NVIDIA GPUs
- Build testable, modular libraries and components
- Extend testable, modular libraries and components
- Develop interfaces to models
- Develop interfaces to sensor drivers
- Develop interfaces to vehicle control
- Profile C++ and CUDA code
- Debug C++ and CUDA code
- Optimize C++ and CUDA code for latency and throughput
- Collaborate with ML researchers
- Collaborate with systems engineers
- Collaborate with automotive partners
- Turn prototype algorithms into production-ready implementations
- Solve technical problems in deep learning
- Solve technical problems in real-time systems
- Solve technical problems in production software engineering
Work Experience
- 8 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Python
- Linux
- CMake
- gdb
- git
- CI/CD
- TensorRT
- NVIDIA DRIVE AGX
- INT8
- FP8
- 4-bit
- TensorRT-LLM
- ONNX Runtime
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
DE01 NVIDIA Germany
Industry
IT
Description
The company is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization.
Not a perfect match?
- NVIDIA
Senior Software Engineer – AI and Autonomous Driving(m/w/x)
Full-timeOn-siteSeniorMünchen - NVIDIA
Senior Software Engineer – ADAS(m/w/x)
Full-timeOn-siteSeniorMünchen - NVIDIA Germany
System Software Integration Engineer - Autonomous Vehicles(m/w/x)
Full-timeOn-siteExperiencedMünchen - Intrinsic
Senior Software Engineer, ML Ops & Infrastructure(m/w/x)
Full-timeOn-siteSeniorMünchen - NVIDIA
Senior Embedded Solutions Architect(m/w/x)
Full-timeOn-siteSeniorMünchen