Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

NVNVIDIA

vor 5 Monaten

Deep Learning Solutions Architect – Inference Optimization(m/w/x)

Zürich

VollzeitVor OrtSenior

AI/ML

Data Science

Nejo KI-Zusammenfassung

Jetzt bewerben

Optimizing large-scale inference pipelines on GPU architectures for AI/VR/AV customer solutions. Modern NLP/LLM architecture knowledge (transformer, diffusion) and DevOps tools (Docker, Kubernetes) proficiency required. Direct customer engagement on groundbreaking AI/VR/AV solutions.

Anforderungen

MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields
5+ years work or research experience with Python, C++, or other software development
Work experience and knowledge of modern NLP including understanding of transformer, state space, diffusion, MOE model architectures
Understanding of key libraries used for NLP/LLM training and/or deployment
Proficient with DevOps tools including Docker, Kubernetes, and Singularity
Demonstrated experience in running and debugging large-scale distributed deep learning training or inference processes
Experience working with larger transformer-based architectures for NLP, CV, ASR, or other
Applied NLP technology in production environments
Enthusiasm for collaborating with various teams and departments
Self-starter with demeanor for growth and passion for continuous learning

Aufgaben

Work directly with key customers to understand their technology
Provide optimal AI solutions for customer needs
Analyze and optimize performance on GPU architecture systems
Support optimization of large-scale inference pipelines
Collaborate with Engineering, Product, and Sales teams
Develop and plan suitable solutions based on customer requirements
Gather customer feedback to enhance product features
Conduct proof-of-concept evaluations

Berufserfahrung

5 Jahre

Ausbildung

Master-Abschluss

Sprachen

Englisch – verhandlungssicher

Tools & Technologien

TRT LLM
vLLM
SGLang
Python
C++
Megatron-LM
NeMo
DeepSpeed
TensorRT-LLM
Triton Inference Server
Docker
Kubernetes
Singularity

Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens NVIDIA erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.

Noch nicht perfekt?

NVIDIA Switzerland AG
Solutions Architect, Cloud Inference Services(m/w/x)
Vollzeitnur vor OrtSenior
Zürich
NVIDIA Switzerland AG
Deep Learning Engineer, LLM Accuracy Evaluation(m/w/x)
Vollzeitnur vor OrtSenior
Zürich
NVIDIA
Senior GPU Networking Architect(m/w/x)
Vollzeitnur vor OrtSenior
Zürich
NVIDIA
HPC and AI Software Architect(m/w/x)
Vollzeitnur vor OrtBerufserfahren
Zürich
NVIDIA Switzerland AG
Senior HPC and AI Network Software Architect(m/w/x)
Vollzeitnur vor OrtSenior
Zürich

Alle 100+ ähnlichen Jobs ansehen

NVNVIDIA

vor 5 Monaten