Dein persönlicher KI-Karriere-Agent
Developer Technology Engineer, Energy(m/w/x)
Optimizing GPU-accelerated applications by analyzing CUDA kernels and memory movement. Strong C/C++ and Python skills on Linux required. Work with NVIDIA Nsight Systems/Compute for performance profiling.
Anforderungen
- BS/MS or equivalent experience in CS/CE/EE/Physics/Applied Math
- Strong C/C++ and Python programming skills on Linux
- CUDA programming and GPU optimization concepts
- Profiling and debugging performance with NVIDIA Nsight Systems/Compute
- Understanding of parallel computing and performance fundamentals
- Clear communication of technical findings
- 5+ years GPU/HPC optimization experience
- HPC experience with MPI, distributed systems, and multi-node tuning
- Energy/HPC domain exposure
- Seismic processing pipelines, RTM/FWI patterns, FFT/stencil/linear algebra codes
- Reservoir simulation, preconditioning, domain decomposition
- Power grid simulation/optimization workflows
- CI/perf regression testing, containerized workflows, schedulers
- Familiarity with AI workflows alongside simulation
Aufgaben
- Profile and optimize GPU-accelerated applications
- Analyze CUDA kernels, memory movement, and concurrency
- Enhance end-to-end throughput of GPU applications
- Optimize CUDA C++ kernel performance
- Configure CUDA launch settings
- Optimize memory hierarchy and streams/events
- Leverage GPU libraries (cuBLAS, cuFFT, cuSPARSE, cuSOLVER, NCCL)
- Scale multi-GPU and multi-node systems using MPI + NCCL
- Overlap CPU and GPU operations
- Design efficient communication patterns
- Create reproducible benchmarks and performance reports
- Generate tuning recommendations
- Develop reference implementations and examples
- Patch customer code for performance and portability
- Support customer engagements from POCs to production
- Debug correctness and performance issues
- Advise on best practices for deployment
- Collaborate with internal teams to file actionable issues
- Validate fixes and influence product roadmap
- Build internal libraries and reusable code
Berufserfahrung
- 5 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Python
- Linux
- CUDA
- NVIDIA Nsight Systems
- Nsight Compute
- MPI
- Docker
- Apptainer
- Slurm
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Noch nicht perfekt?
- NVIDIAVollzeitnur vor OrtSeniorZürich
- NVIDIA
Senior GPU Networking Architect(m/w/x)
Vollzeitnur vor OrtSeniorZürich - CH01 NVIDIA Switzerland AG
Senior System Software Engineer, NCCL - Partner Enablement(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA
Deep Learning Solutions Architect – Inference Optimization(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA
HPC and AI Software Architect(m/w/x)
Vollzeitnur vor OrtBerufserfahrenZürich
Developer Technology Engineer, Energy(m/w/x)
Optimizing GPU-accelerated applications by analyzing CUDA kernels and memory movement. Strong C/C++ and Python skills on Linux required. Work with NVIDIA Nsight Systems/Compute for performance profiling.
Anforderungen
- BS/MS or equivalent experience in CS/CE/EE/Physics/Applied Math
- Strong C/C++ and Python programming skills on Linux
- CUDA programming and GPU optimization concepts
- Profiling and debugging performance with NVIDIA Nsight Systems/Compute
- Understanding of parallel computing and performance fundamentals
- Clear communication of technical findings
- 5+ years GPU/HPC optimization experience
- HPC experience with MPI, distributed systems, and multi-node tuning
- Energy/HPC domain exposure
- Seismic processing pipelines, RTM/FWI patterns, FFT/stencil/linear algebra codes
- Reservoir simulation, preconditioning, domain decomposition
- Power grid simulation/optimization workflows
- CI/perf regression testing, containerized workflows, schedulers
- Familiarity with AI workflows alongside simulation
Aufgaben
- Profile and optimize GPU-accelerated applications
- Analyze CUDA kernels, memory movement, and concurrency
- Enhance end-to-end throughput of GPU applications
- Optimize CUDA C++ kernel performance
- Configure CUDA launch settings
- Optimize memory hierarchy and streams/events
- Leverage GPU libraries (cuBLAS, cuFFT, cuSPARSE, cuSOLVER, NCCL)
- Scale multi-GPU and multi-node systems using MPI + NCCL
- Overlap CPU and GPU operations
- Design efficient communication patterns
- Create reproducible benchmarks and performance reports
- Generate tuning recommendations
- Develop reference implementations and examples
- Patch customer code for performance and portability
- Support customer engagements from POCs to production
- Debug correctness and performance issues
- Advise on best practices for deployment
- Collaborate with internal teams to file actionable issues
- Validate fixes and influence product roadmap
- Build internal libraries and reusable code
Berufserfahrung
- 5 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Python
- Linux
- CUDA
- NVIDIA Nsight Systems
- Nsight Compute
- MPI
- Docker
- Apptainer
- Slurm
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Über das Unternehmen
NVIDIA
Branche
IT
Beschreibung
The company is developing groundbreaking solutions in Virtual Reality, Artificial Intelligence, Deep Learning, and Autonomous Vehicles.
Noch nicht perfekt?
- NVIDIA
Compute DevTech Engineer(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA
Senior GPU Networking Architect(m/w/x)
Vollzeitnur vor OrtSeniorZürich - CH01 NVIDIA Switzerland AG
Senior System Software Engineer, NCCL - Partner Enablement(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA
Deep Learning Solutions Architect – Inference Optimization(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA
HPC and AI Software Architect(m/w/x)
Vollzeitnur vor OrtBerufserfahrenZürich