Dein persönlicher KI-Karriere-Agent
Senior HPC and AI Network Software Architect(m/w/x)
Building scalable software for distributed AI training and inference on NVIDIA GPUs. Evolving architecture for throughput, latency, and memory efficiency. Ph.D. or 5+ years systems programming experience required. Comprehensive benefits package.
Anforderungen
- Ph.D. or equivalent industry experience in computer science/engineering
- 5+ years systems programming, parallel/distributed computing, high-performance networking, or large-scale data movement
- Designing and building complex systems
- Strong programming background in C++
- Strong programming background in Python
- Ideally CUDA or other GPU programming models
- Building production-quality performance-critical software
- Extensive hands-on experience with AI frameworks
- Solid grasp of communication libraries and runtime systems
- Developing and refining high-throughput, low-latency systems
- Reasoning across software stacks, hardware capabilities, and system bottlenecks
- Strong collaboration skills in multi-national, interdisciplinary setting
- Contributing ideas and building momentum
- Working effectively with senior engineers, researchers, and partner teams
- Deep expertise with NCCL, UCX, UCC, or similar communication libraries
- Strong background in networking and communication protocols
- Strong background in RDMA
- Strong background in collective communications
- Strong background in congestion-aware transport
- Strong background in accelerator-aware networking
- Comprehensive knowledge of large model training and inference serving
- Understanding communication bottlenecks, scheduling challenges, and system tradeoffs
- Experience crafting hardware-software co-design for distributed AI systems
- Contributions advancing GPU, DPU, interconnect, or runtime capabilities
- Familiarity with infrastructure for deployment of LLMs or transformer-based models
- Familiarity with sharding, pipelining, expert parallelism, or hybrid parallelism
Aufgaben
- Build scalable software systems for distributed AI training and inference
- Evolve architecture focusing on throughput, latency, resiliency, and memory efficiency
- Develop next-generation communication and runtime capabilities in libraries
- Evaluate libraries like NCCL, UCX, and UCC for frontier AI workloads
- Partner with AI framework teams to build integrations and improve performance
- Collaborate with internal platform teams to explore new approaches
- Work on hardware and system-level features to speed up data movement
- Enable new capabilities for training, inference, and model serving
- Drive innovation in runtime systems and communication libraries
- Help turn new ideas into practical and robust implementations
Berufserfahrung
- 5 Jahre
Ausbildung
- Doktor / Ph.D.
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Python
- CUDA
- PyTorch
- TensorFlow
- JAX
- NCCL
- UCX
- UCC
- RDMA
Benefits
Sonstige Vorteile
- Comprehensive benefits package
Noch nicht perfekt?
- NVIDIAVollzeitnur vor OrtBerufserfahrenZürich
- NVIDIA
Senior GPU Networking Architect(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA Switzerland AG
HPC and AI Software Architecture Intern(m/w/x)
VollzeitPraktikumnur vor OrtZürich - NVIDIA
Senior Software Developer(m/w/x)
Vollzeitnur vor OrtSeniorZürich
Senior HPC and AI Network Software Architect(m/w/x)
Building scalable software for distributed AI training and inference on NVIDIA GPUs. Evolving architecture for throughput, latency, and memory efficiency. Ph.D. or 5+ years systems programming experience required. Comprehensive benefits package.
Anforderungen
- Ph.D. or equivalent industry experience in computer science/engineering
- 5+ years systems programming, parallel/distributed computing, high-performance networking, or large-scale data movement
- Designing and building complex systems
- Strong programming background in C++
- Strong programming background in Python
- Ideally CUDA or other GPU programming models
- Building production-quality performance-critical software
- Extensive hands-on experience with AI frameworks
- Solid grasp of communication libraries and runtime systems
- Developing and refining high-throughput, low-latency systems
- Reasoning across software stacks, hardware capabilities, and system bottlenecks
- Strong collaboration skills in multi-national, interdisciplinary setting
- Contributing ideas and building momentum
- Working effectively with senior engineers, researchers, and partner teams
- Deep expertise with NCCL, UCX, UCC, or similar communication libraries
- Strong background in networking and communication protocols
- Strong background in RDMA
- Strong background in collective communications
- Strong background in congestion-aware transport
- Strong background in accelerator-aware networking
- Comprehensive knowledge of large model training and inference serving
- Understanding communication bottlenecks, scheduling challenges, and system tradeoffs
- Experience crafting hardware-software co-design for distributed AI systems
- Contributions advancing GPU, DPU, interconnect, or runtime capabilities
- Familiarity with infrastructure for deployment of LLMs or transformer-based models
- Familiarity with sharding, pipelining, expert parallelism, or hybrid parallelism
Aufgaben
- Build scalable software systems for distributed AI training and inference
- Evolve architecture focusing on throughput, latency, resiliency, and memory efficiency
- Develop next-generation communication and runtime capabilities in libraries
- Evaluate libraries like NCCL, UCX, and UCC for frontier AI workloads
- Partner with AI framework teams to build integrations and improve performance
- Collaborate with internal platform teams to explore new approaches
- Work on hardware and system-level features to speed up data movement
- Enable new capabilities for training, inference, and model serving
- Drive innovation in runtime systems and communication libraries
- Help turn new ideas into practical and robust implementations
Berufserfahrung
- 5 Jahre
Ausbildung
- Doktor / Ph.D.
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Python
- CUDA
- PyTorch
- TensorFlow
- JAX
- NCCL
- UCX
- UCC
- RDMA
Benefits
Sonstige Vorteile
- Comprehensive benefits package
Über das Unternehmen
NVIDIA Switzerland AG
Branche
IT
Beschreibung
NVIDIA is a leading technology company specializing in Deep Learning, Artificial Intelligence, and Supercomputing.
Noch nicht perfekt?
- NVIDIA
HPC and AI Software Architect(m/w/x)
Vollzeitnur vor OrtBerufserfahrenZürich - NVIDIA
Senior GPU Networking Architect(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Vollzeitnur vor OrtSeniorZürich - NVIDIA Switzerland AG
HPC and AI Software Architecture Intern(m/w/x)
VollzeitPraktikumnur vor OrtZürich - NVIDIA
Senior Software Developer(m/w/x)
Vollzeitnur vor OrtSeniorZürich