Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

TETether Operations Limited

vor 18 Tagen

.AI Research Engineer (Model Compression & Quantization)(m/w/x)

Zürich

VollzeitRemoteSenior

AI/ML

Nejo KI-Zusammenfassung

Jetzt bewerben

Applying low-bit quantization to generative AI models for financial token integration. PhD in NLP/ML and A* publications required. Focus on model compression and efficient deployment.

Anforderungen

Degree in Computer Science or related field
PhD in NLP, Machine Learning, or related field
Solid track record in AI R&D with A* publications
Experience with PyTorch or equivalent frameworks
Hands-on experience with model quantization (QAT and PTQ)
Research and hands-on experience with knowledge distillation
Research and hands-on experience with model pruning
Solid understanding of neural network architectures and training
Understanding of transformers (LLMs, VLMs), backpropagation, optimization, fine-tuning
Familiarity with C++ (advantageous)

Aufgaben

Drive innovation in model compression and efficient deployment
Reduce model footprint and computational cost
Apply low-bit quantization to generative AI models
Maintain accuracy and output quality during quantization
Leverage knowledge distillation for efficient multimodal reasoning
Implement pruning techniques to reduce computational overhead
Analyze trade-offs between model efficiency and accuracy
Propose improvements based on empirical findings
Research mixed-precision quantization and advanced compression strategies
Stay current with the latest research in model compression
Document methodologies, experiments, and results clearly
Support reproducibility and internal collaboration
Communicate results to stakeholders
Author technical papers and publish findings
Advance the field of model compression for multimodal AI

Berufserfahrung

ca. 4 - 6 Jahre

Ausbildung

Bachelor-Abschluss

Sprachen

Englisch – verhandlungssicher

Tools & Technologien

PyTorch
model quantization
Quantization-Aware Training (QAT)
Post-Training Quantization (PTQ)
knowledge distillation
model pruning
neural network architectures
transformers
LLMs
VLMs
backpropagation
optimization
fine-tuning
C++

Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens Tether Operations Limited erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.

Noch nicht perfekt?

Tether Operations Limited
AI Research Engineer - Kernel & Inference Optimization(m/w/x)
VollzeitRemoteSenior
Zürich
Anthropic
Research Engineer / Research Scientist, Pre-training(m/w/x)
Vollzeitmit HomeofficeBerufserfahren
Zürich
ab CHF 280.000 - 680.000 / Jahr
ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Vollzeitmit HomeofficeSenior
Zürich
Avaloq
.AI Software Engineer(m/w/x)
Vollzeitmit HomeofficeSenior
Zürich
Mistral
AI Scientist(m/w/x)
Vollzeitmit HomeofficeKeine Angabe
Zürich

Alle 100+ ähnlichen Jobs ansehen

TETether Operations Limited

vor 18 Tagen