The AI Job Search Engine
Deep Learning Solutions Architect – Inference Optimization(m/w/x)
Description
In this role, you will engage with key customers to deliver tailored AI solutions while optimizing performance on advanced GPU systems. Your work will involve collaboration across teams and gathering insights to drive product development.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields
- •5+ years work or research experience with Python, C++, or other software development
- •Work experience and knowledge of modern NLP including understanding of transformer, state space, diffusion, MOE model architectures
- •Understanding of key libraries used for NLP/LLM training and/or deployment
- •Proficient with DevOps tools including Docker, Kubernetes, and Singularity
- •Demonstrated experience in running and debugging large-scale distributed deep learning training or inference processes
- •Experience working with larger transformer-based architectures for NLP, CV, ASR, or other
- •Applied NLP technology in production environments
- •Enthusiasm for collaborating with various teams and departments
- •Self-starter with demeanor for growth and passion for continuous learning
Education
Work Experience
5 years
Tasks
- •Work directly with key customers to understand their technology
- •Provide optimal AI solutions for customer needs
- •Analyze and optimize performance on GPU architecture systems
- •Support optimization of large-scale inference pipelines
- •Collaborate with Engineering, Product, and Sales teams
- •Develop and plan suitable solutions based on customer requirements
- •Gather customer feedback to enhance product features
- •Conduct proof-of-concept evaluations
Tools & Technologies
Languages
English – Business Fluent
- NVIDIAFull-timeOn-siteExperiencedZürich
- NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Full-timeOn-siteSeniorZürich - Red Hat (Switzerland) SARL
Senior Machine Learning Engineer - Red Hat Inference(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA
Senior Software Developer(m/w/x)
Full-timeOn-siteSeniorZürich - RepRisk AG
Senior AI Engineer(m/w/x)
Full-timeOn-siteSeniorZürich
Deep Learning Solutions Architect – Inference Optimization(m/w/x)
The AI Job Search Engine
Description
In this role, you will engage with key customers to deliver tailored AI solutions while optimizing performance on advanced GPU systems. Your work will involve collaboration across teams and gathering insights to drive product development.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields
- •5+ years work or research experience with Python, C++, or other software development
- •Work experience and knowledge of modern NLP including understanding of transformer, state space, diffusion, MOE model architectures
- •Understanding of key libraries used for NLP/LLM training and/or deployment
- •Proficient with DevOps tools including Docker, Kubernetes, and Singularity
- •Demonstrated experience in running and debugging large-scale distributed deep learning training or inference processes
- •Experience working with larger transformer-based architectures for NLP, CV, ASR, or other
- •Applied NLP technology in production environments
- •Enthusiasm for collaborating with various teams and departments
- •Self-starter with demeanor for growth and passion for continuous learning
Education
Work Experience
5 years
Tasks
- •Work directly with key customers to understand their technology
- •Provide optimal AI solutions for customer needs
- •Analyze and optimize performance on GPU architecture systems
- •Support optimization of large-scale inference pipelines
- •Collaborate with Engineering, Product, and Sales teams
- •Develop and plan suitable solutions based on customer requirements
- •Gather customer feedback to enhance product features
- •Conduct proof-of-concept evaluations
Tools & Technologies
Languages
English – Business Fluent
About the Company
NVIDIA
Industry
IT
Description
The company is developing groundbreaking solutions in Virtual Reality, Artificial Intelligence, Deep Learning, and Autonomous Vehicles.
- NVIDIA
HPC and AI Software Architect(m/w/x)
Full-timeOn-siteExperiencedZürich - NVIDIA Switzerland AG
Principal Software Architect, GPU Networking Research(m/w/x)
Full-timeOn-siteSeniorZürich - Red Hat (Switzerland) SARL
Senior Machine Learning Engineer - Red Hat Inference(m/w/x)
Full-timeOn-siteSeniorZürich - NVIDIA
Senior Software Developer(m/w/x)
Full-timeOn-siteSeniorZürich - RepRisk AG
Senior AI Engineer(m/w/x)
Full-timeOn-siteSeniorZürich