Skip to content
Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

NVNVIDIA Switzerland AG

Solutions Architect, Cloud Inference Services(m/w/x)

Zürich
VollzeitVor OrtSenior
AI/ML
Data Science

Deploying E2E AI solutions for NVIDIA Cloud Partners, focusing on LLMs and Agentic Pipelines. Master's or Ph.D. in CS/AI or equivalent experience required. Support for AI services from training to inference.

Anforderungen

  • Excellent verbal and written communication skills
  • Excellent technical presentation skills in English
  • Master's or Ph.D. in Computer Science or AI
  • Equivalent experience in Computer Science or AI
  • 5+ years industry/academic experience in ML/DL/Data Science
  • Preference for DNN inference experience
  • Work experience with modern LLM, VLM, diffusion architectures
  • Emphasis on MoE architectures
  • Understanding of DNN inference libraries
  • Understanding of agentic pipeline development
  • Excited to work with multiple levels and teams
  • Collaboration with Engineering, Product, Sales, Marketing
  • Strong analytical skills
  • Strong problem-solving skills
  • Self-starter drive for growth
  • Passion for continuous learning
  • Sharing findings across the team
  • Strong time-management skills
  • Strong organization skills
  • Coordinating multiple initiatives and priorities
  • Implementing new technology and products
  • Experience with inference of large MoE architectures
  • Experience with NLP, CV, or ASR inference
  • Experience using DevOps technologies
  • Understanding of HPC systems
  • Understanding of data center design
  • Understanding of high speed interconnect InfiniBand
  • Understanding of Cluster Storage
  • Understanding of Scheduling related design/management

Aufgaben

  • Help NVIDIA Cloud Partners integrate AI stacks
  • Develop and deploy E2E AI solutions
  • Support AI services from training to inference
  • Participate in projects involving LLMs, VLMs, Physical-AI, Agentic Pipelines
  • Coordinate between customers, marketing, business development, and engineering
  • Work on proof-of-concept demonstrations
  • Lead discussions with developers, product teams, and executives
  • Encourage adoption of NVIDIA’s AI technology platform
  • Simplify deployment of AI technology to production
  • Engage with different roles within NVIDIA and partners
  • Understand NCPs' technology and provide solutions
  • Develop and demonstrate NLP and LLM solutions
  • Integrate solutions into agentic pipelines
  • Perform in-depth GPU system analysis and optimization
  • Optimize end-to-end agentic pipelines
  • Partner with Engineering, Product, and Sales teams
  • Develop and plan suitable solutions for customers
  • Enable product feature growth through customer feedback
  • Build industry expertise in AI Cloud solutions
  • Contribute to integrating NVIDIA technology in Enterprise Computing

Berufserfahrung

  • 5 Jahre

Ausbildung

  • Master-Abschluss

Sprachen

  • Englischfließend

Tools & Technologien

  • Machine learning
  • Deep learning
  • Data science
  • DNN inference
  • LLM
  • VLM
  • Diffusion architectures
  • MoE
  • TRT-LLM
  • Dynamo
  • RedHat Inference Server
  • DevOps
  • Docker
  • Kubernetes
  • Singularity
  • HPC systems
  • InfiniBand
  • Cluster Storage
Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens NVIDIA Switzerland AG erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.


  • NVIDIA

    Deep Learning Solutions Architect – Inference Optimization(m/w/x)

    Vollzeitnur vor OrtSenior
    Zürich
  • NVIDIA Switzerland AG

    Senior HPC and AI Network Software Architect(m/w/x)

    Vollzeitnur vor OrtSenior
    Zürich
  • NVIDIA

    HPC and AI Software Architect(m/w/x)

    Vollzeitnur vor OrtBerufserfahren
    Zürich
  • NVIDIA Switzerland AG

    Deep Learning Engineer, LLM Accuracy Evaluation(m/w/x)

    Vollzeitnur vor OrtSenior
    Zürich
  • NVIDIA

    Senior GPU Networking Architect(m/w/x)

    Vollzeitnur vor OrtSenior
    Zürich
Alle 100+ ähnlichen Jobs ansehen

Diese Jobs könnten dich auch interessieren