Skip to content
New Job?Nejo!

Your personal AI career agent

DEDE01 NVIDIA Germany

Senior System Software Engineer – Embedded AI Inference(m/w/x)

München
Full-timeOn-siteSenior
AI/ML
Data Science

Designing and deploying C++ agentic AI inference solutions with PyTorch integration for real-time GPU deployment. High-performance safety-critical software experience required. Hybrid work, 4-day work week.

Requirements

  • 8+ years professional software engineering experience
  • Experience in high-performance safety-critical software
  • Experience in automotive, robotics, or real-time systems
  • Master's or PhD in Computer Science or Machine Learning
  • Strong modern C++ (C++14/17 or later)
  • C++ templates, RAII, smart pointers, STL
  • Experience building large C++ codebases
  • Solid Python skills for tooling
  • Solid Python skills for training scripts
  • Solid Python skills for glue code
  • Hands-on experience building agentic AI frameworks
  • Experience with LLM / VLM inference
  • Experience with LLM and VLM inference optimization
  • Experience with speculative decoding, LoRA, MoE
  • Experience developing on Linux
  • Experience with Linux build systems (CMake)
  • Experience with Linux debugging (gdb, sanitizers)
  • Experience with Linux profiling
  • Experience with git-based workflows in CI/CD
  • Familiarity with GPU programming and optimization
  • Familiarity with TensorRT
  • Experience with agentic AI
  • Experience with agents based on edge-friendly models (2–7B)
  • Experience with agent context management
  • Experience with reliable agent tool calling
  • Experience with agent MCP
  • Experience with agentic coding
  • Direct experience with NVIDIA DRIVE AGX platform
  • Knowledge of AI model optimization
  • Knowledge of AI model deployment
  • Knowledge of quantization (INT8, FP8, 4-bit)
  • Familiarity with high-performance LLM inference frameworks
  • Familiarity with TensorRT-LLM or ONNX Runtime
  • Understanding of software quality practices for safety-critical systems
  • Knowledge of code review, unit testing, static analysis
  • Automotive standards knowledge is a plus
  • Open-source contributions in AI, robotics, or GPU computing
  • Published work in AI, robotics, or GPU computing

Tasks

  • Design C++ agentic AI and AI inference solutions
  • Implement C++ agentic AI and AI inference solutions
  • Maintain C++ agentic AI and AI inference solutions
  • Integrate PyTorch Deep Learning models into C++ pipelines
  • Deploy AI models for real-time inference on NVIDIA GPUs
  • Build testable, modular libraries and components
  • Extend testable, modular libraries and components
  • Develop interfaces to models
  • Develop interfaces to sensor drivers
  • Develop interfaces to vehicle control
  • Profile C++ and CUDA code
  • Debug C++ and CUDA code
  • Optimize C++ and CUDA code for latency and throughput
  • Collaborate with ML researchers
  • Collaborate with systems engineers
  • Collaborate with automotive partners
  • Turn prototype algorithms into production-ready implementations
  • Solve technical problems in deep learning
  • Solve technical problems in real-time systems
  • Solve technical problems in production software engineering

Work Experience

  • 8 years

Education

  • Master's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • C++
  • Python
  • Linux
  • CMake
  • gdb
  • git
  • CI/CD
  • TensorRT
  • NVIDIA DRIVE AGX
  • INT8
  • FP8
  • 4-bit
  • TensorRT-LLM
  • ONNX Runtime
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of DE01 NVIDIA Germany and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • NVIDIA

    Senior Software Engineer – AI and Autonomous Driving(m/w/x)

    Full-timeOn-siteSenior
    München
  • NVIDIA

    Senior Software Engineer – ADAS(m/w/x)

    Full-timeOn-siteSenior
    München
  • NVIDIA Germany

    System Software Integration Engineer - Autonomous Vehicles(m/w/x)

    Full-timeOn-siteExperienced
    München
  • Intrinsic

    Senior Software Engineer, ML Ops & Infrastructure(m/w/x)

    Full-timeOn-siteSenior
    München
  • Applied Intuition

    Software Engineer - Workbench(m/w/x)

    Full-timeOn-siteSenior
    München
    from 108,200 - 180,000 / year
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes