New Job?Nejo!

Your personal AI career agent

NVNVIDIA

4mo ago

HPC and AI Software Architect(m/w/x)

Zürich

Full-timeOn-siteExperienced

AI/ML

Nejo AI Summary

Apply now

Optimizing distributed AI training and inference systems, enhancing communication libraries (NCCL, UCX, UCC) and co-designing hardware for data movement acceleration for AI/VR solutions. Ph.D. or equivalent industry experience in computer science, with 2+ years in high-performance data movement or distributed computing, required. Direct impact on groundbreaking AI/VR/Autonomous Vehicle solutions.

Requirements

Ph.D. or equivalent industry experience in computer science, computer engineering, or a closely related field
2+ years of experience in systems programming, parallel or distributed computing, or high-performance data movement
Strong programming background in C++, Python, and ideally CUDA or other GPU programming models
Practical experience with AI frameworks (e.g., PyTorch, TensorFlow) and familiarity with communication libraries
Experience in designing or optimizing software for high-throughput, low-latency systems
Strong collaboration skills in a multi-national, interdisciplinary environment
Expertise with NCCL, Gloo, UCX, or similar libraries used in distributed AI workloads
Background in networking and communication protocols, RDMA, collective communications, or accelerator-aware networking
Deep understanding of large model training, inference serving at scale, and associated communication bottlenecks
Knowledge of quantization, tensor/activation fusion, or memory optimization for inference
Familiarity with infrastructure for deployment of LLMs or transformer-based models, including sharding, pipelining, or hybrid parallelism

Tasks

Design and prototype scalable software systems for distributed AI training and inference
Optimize throughput, latency, and memory efficiency
Develop and evaluate enhancements to communication libraries like NCCL, UCX, and UCC
Collaborate with AI framework teams to improve communication backend integration and performance
Co-design hardware features to accelerate data movement for inference and model serving
Contribute to the evolution of runtime systems and AI-specific protocol layers

Work Experience

2 years

Education

Doctoral / PhD

Languages

English – Business Fluent

Tools & Technologies

C++
Python
CUDA
PyTorch
TensorFlow
NCCL
Gloo
UCX

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of NVIDIA and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Like this job?

Beta

Your Career Agent finds similar jobs for you every day.

Not a perfect match?

100+ Similar Jobs in Zürich View all

NVIDIA Switzerland AG
Senior HPC and AI Network Software Architect(m/w/x)
Full-timeOn-siteSenior
Zürich
NVIDIA Switzerland AG
HPC and AI Software Architecture Intern(m/w/x)
Full-timeInternshipOn-site
Zürich
NVIDIA
Senior GPU Networking Architect(m/w/x)
Full-timeOn-siteSenior
Zürich
NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Full-timeOn-siteSenior
Zürich
CH01 NVIDIA Switzerland AG
Senior Software Developer, AI Networking(m/w/x)
Full-timeOn-siteSenior
Zürich

View all 100+ similar jobs

NVNVIDIA

4mo ago