New Job?Nejo!

Your personal AI career agent

TETether Operations Limited

5h ago

.AI Research Engineer (Model Compression & Quantization)(m/w/x)

Zürich

Full-timeRemoteSenior

AI/ML

Nejo AI Summary

Apply now

Applying low-bit quantization to generative AI models for financial token integration. PhD in NLP/ML and A* publications required. Focus on model compression and efficient deployment.

Requirements

Degree in Computer Science or related field
PhD in NLP, Machine Learning, or related field
Solid track record in AI R&D with A* publications
Experience with PyTorch or equivalent frameworks
Hands-on experience with model quantization (QAT and PTQ)
Research and hands-on experience with knowledge distillation
Research and hands-on experience with model pruning
Solid understanding of neural network architectures and training
Understanding of transformers (LLMs, VLMs), backpropagation, optimization, fine-tuning
Familiarity with C++ (advantageous)

Tasks

Drive innovation in model compression and efficient deployment
Reduce model footprint and computational cost
Apply low-bit quantization to generative AI models
Maintain accuracy and output quality during quantization
Leverage knowledge distillation for efficient multimodal reasoning
Implement pruning techniques to reduce computational overhead
Analyze trade-offs between model efficiency and accuracy
Propose improvements based on empirical findings
Research mixed-precision quantization and advanced compression strategies
Stay current with the latest research in model compression
Document methodologies, experiments, and results clearly
Support reproducibility and internal collaboration
Communicate results to stakeholders
Author technical papers and publish findings
Advance the field of model compression for multimodal AI

Work Experience

approx. 4 - 6 years

Education

Bachelor's degree

Languages

English – Business Fluent

Tools & Technologies

PyTorch
model quantization
Quantization-Aware Training (QAT)
Post-Training Quantization (PTQ)
knowledge distillation
model pruning
neural network architectures
transformers
LLMs
VLMs
backpropagation
optimization
fine-tuning
C++

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether Operations Limited and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Like this job?

Beta

Your Career Agent finds similar jobs for you every day.

Not a perfect match?

100+ Similar Jobs in Zürich View all

Tether Operations Limited
AI Research Engineer - Kernel & Inference Optimization(m/w/x)
Full-timeRemoteSenior
Zürich
ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Full-timeWith HomeofficeSenior
Zürich
Anthropic
Research Engineer / Research Scientist, Pre-training(m/w/x)
Full-timeWith HomeofficeExperienced
Zürich
from CHF 280,000 - 680,000 / year
Avaloq
.AI Software Engineer(m/w/x)
Full-timeWith HomeofficeSenior
Zürich
Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specified
Zürich

View all 100+ similar jobs

TETether Operations Limited

5h ago