Skip to content
Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

TETether Operations Limited

.AI Research Engineer (Model Compression & Quantization)(m/w/x)

Zürich
VollzeitRemoteSenior
AI/ML

Applying low-bit quantization to generative AI models for financial token integration. PhD in NLP/ML and A* publications required. Focus on model compression and efficient deployment.

Anforderungen

  • Degree in Computer Science or related field
  • PhD in NLP, Machine Learning, or related field
  • Solid track record in AI R&D with A* publications
  • Experience with PyTorch or equivalent frameworks
  • Hands-on experience with model quantization (QAT and PTQ)
  • Research and hands-on experience with knowledge distillation
  • Research and hands-on experience with model pruning
  • Solid understanding of neural network architectures and training
  • Understanding of transformers (LLMs, VLMs), backpropagation, optimization, fine-tuning
  • Familiarity with C++ (advantageous)

Aufgaben

  • Drive innovation in model compression and efficient deployment
  • Reduce model footprint and computational cost
  • Apply low-bit quantization to generative AI models
  • Maintain accuracy and output quality during quantization
  • Leverage knowledge distillation for efficient multimodal reasoning
  • Implement pruning techniques to reduce computational overhead
  • Analyze trade-offs between model efficiency and accuracy
  • Propose improvements based on empirical findings
  • Research mixed-precision quantization and advanced compression strategies
  • Stay current with the latest research in model compression
  • Document methodologies, experiments, and results clearly
  • Support reproducibility and internal collaboration
  • Communicate results to stakeholders
  • Author technical papers and publish findings
  • Advance the field of model compression for multimodal AI

Berufserfahrung

  • ca. 4 - 6 Jahre

Ausbildung

  • Bachelor-Abschluss

Sprachen

  • Englischverhandlungssicher

Tools & Technologien

  • PyTorch
  • model quantization
  • Quantization-Aware Training (QAT)
  • Post-Training Quantization (PTQ)
  • knowledge distillation
  • model pruning
  • neural network architectures
  • transformers
  • LLMs
  • VLMs
  • backpropagation
  • optimization
  • fine-tuning
  • C++
Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens Tether Operations Limited erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.


  • Tether Operations Limited

    AI Research Engineer - Kernel & Inference Optimization(m/w/x)

    VollzeitRemoteSenior
    Zürich
  • Anthropic

    Research Engineer / Research Scientist, Pre-training(m/w/x)

    Vollzeitmit HomeofficeBerufserfahren
    Zürich
    ab CHF 280.000 - 680.000 / Jahr
  • ANYbotics

    Senior AI Research Engineer, Visual Perception(m/w/x)

    Vollzeitmit HomeofficeSenior
    Zürich
  • Avaloq

    .AI Software Engineer(m/w/x)

    Vollzeitmit HomeofficeSenior
    Zürich
  • Mistral

    AI Scientist(m/w/x)

    Vollzeitmit HomeofficeKeine Angabe
    Zürich
Alle 100+ ähnlichen Jobs ansehen