Your personal AI career agent
Senior HPC and AI Network Software Architect(m/w/x)
Building scalable software for distributed AI training and inference on NVIDIA GPUs. Evolving architecture for throughput, latency, and memory efficiency. Ph.D. or 5+ years systems programming experience required. Comprehensive benefits package.
Requirements
- Ph.D. or equivalent industry experience in computer science/engineering
- 5+ years systems programming, parallel/distributed computing, high-performance networking, or large-scale data movement
- Designing and building complex systems
- Strong programming background in C++
- Strong programming background in Python
- Ideally CUDA or other GPU programming models
- Building production-quality performance-critical software
- Extensive hands-on experience with AI frameworks
- Solid grasp of communication libraries and runtime systems
- Developing and refining high-throughput, low-latency systems
- Reasoning across software stacks, hardware capabilities, and system bottlenecks
- Strong collaboration skills in multi-national, interdisciplinary setting
- Contributing ideas and building momentum
- Working effectively with senior engineers, researchers, and partner teams
- Deep expertise with NCCL, UCX, UCC, or similar communication libraries
- Strong background in networking and communication protocols
- Strong background in RDMA
- Strong background in collective communications
- Strong background in congestion-aware transport
- Strong background in accelerator-aware networking
- Comprehensive knowledge of large model training and inference serving
- Understanding communication bottlenecks, scheduling challenges, and system tradeoffs
- Experience crafting hardware-software co-design for distributed AI systems
- Contributions advancing GPU, DPU, interconnect, or runtime capabilities
- Familiarity with infrastructure for deployment of LLMs or transformer-based models
- Familiarity with sharding, pipelining, expert parallelism, or hybrid parallelism
Tasks
- Build scalable software systems for distributed AI training and inference
- Evolve architecture focusing on throughput, latency, resiliency, and memory efficiency
- Develop next-generation communication and runtime capabilities in libraries
- Evaluate libraries like NCCL, UCX, and UCC for frontier AI workloads
- Partner with AI framework teams to build integrations and improve performance
- Collaborate with internal platform teams to explore new approaches
- Work on hardware and system-level features to speed up data movement
- Enable new capabilities for training, inference, and model serving
- Drive innovation in runtime systems and communication libraries
- Help turn new ideas into practical and robust implementations
Work Experience
- 5 years
Education
- Doctoral / PhD
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Python
- CUDA
- PyTorch
- TensorFlow
- JAX
- NCCL
- UCX
- UCC
- RDMA
Benefits
Other Benefits
- Comprehensive benefits package
Not a perfect match?
- NVIDIAFull-timeOn-siteExperiencedZürich
- NVIDIA
Senior GPU Networking Architect(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA Switzerland AG
HPC and AI Software Architecture Intern(m/w/x)
Full-timeInternshipOn-siteZürich - NVIDIA
Senior Software Developer(m/w/x)
Full-timeOn-siteSeniorZürich
Senior HPC and AI Network Software Architect(m/w/x)
Building scalable software for distributed AI training and inference on NVIDIA GPUs. Evolving architecture for throughput, latency, and memory efficiency. Ph.D. or 5+ years systems programming experience required. Comprehensive benefits package.
Requirements
- Ph.D. or equivalent industry experience in computer science/engineering
- 5+ years systems programming, parallel/distributed computing, high-performance networking, or large-scale data movement
- Designing and building complex systems
- Strong programming background in C++
- Strong programming background in Python
- Ideally CUDA or other GPU programming models
- Building production-quality performance-critical software
- Extensive hands-on experience with AI frameworks
- Solid grasp of communication libraries and runtime systems
- Developing and refining high-throughput, low-latency systems
- Reasoning across software stacks, hardware capabilities, and system bottlenecks
- Strong collaboration skills in multi-national, interdisciplinary setting
- Contributing ideas and building momentum
- Working effectively with senior engineers, researchers, and partner teams
- Deep expertise with NCCL, UCX, UCC, or similar communication libraries
- Strong background in networking and communication protocols
- Strong background in RDMA
- Strong background in collective communications
- Strong background in congestion-aware transport
- Strong background in accelerator-aware networking
- Comprehensive knowledge of large model training and inference serving
- Understanding communication bottlenecks, scheduling challenges, and system tradeoffs
- Experience crafting hardware-software co-design for distributed AI systems
- Contributions advancing GPU, DPU, interconnect, or runtime capabilities
- Familiarity with infrastructure for deployment of LLMs or transformer-based models
- Familiarity with sharding, pipelining, expert parallelism, or hybrid parallelism
Tasks
- Build scalable software systems for distributed AI training and inference
- Evolve architecture focusing on throughput, latency, resiliency, and memory efficiency
- Develop next-generation communication and runtime capabilities in libraries
- Evaluate libraries like NCCL, UCX, and UCC for frontier AI workloads
- Partner with AI framework teams to build integrations and improve performance
- Collaborate with internal platform teams to explore new approaches
- Work on hardware and system-level features to speed up data movement
- Enable new capabilities for training, inference, and model serving
- Drive innovation in runtime systems and communication libraries
- Help turn new ideas into practical and robust implementations
Work Experience
- 5 years
Education
- Doctoral / PhD
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Python
- CUDA
- PyTorch
- TensorFlow
- JAX
- NCCL
- UCX
- UCC
- RDMA
Benefits
Other Benefits
- Comprehensive benefits package
About the Company
NVIDIA Switzerland AG
Industry
IT
Description
NVIDIA is a leading technology company specializing in Deep Learning, Artificial Intelligence, and Supercomputing.
Not a perfect match?
- NVIDIA
HPC and AI Software Architect(m/w/x)
Full-timeOn-siteExperiencedZürich - NVIDIA
Senior GPU Networking Architect(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA Switzerland AG
HPC and AI Software Architecture Intern(m/w/x)
Full-timeInternshipOn-siteZürich - NVIDIA
Senior Software Developer(m/w/x)
Full-timeOn-siteSeniorZürich