Die KI-Suchmaschine für Jobs
Senior AI Inference Engineer - llama.cpp specialist(m/w/x)
Beschreibung
In this role, you will focus on optimizing AI inference engines for edge devices, collaborating with researchers to transition models into production, and enhancing existing products with cutting-edge machine learning features.
Lass KI die perfekten Jobs für dich finden!
Lade deinen CV hoch und die Nejo-KI findet passende Stellenangebote für dich.
Anforderungen
- •Excellent programming skills in C++
- •Experience in Javascript
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Solid track record in AI R&D
Ausbildung
Berufserfahrung
ca. 4 - 6 Jahre
Aufgaben
- •Work on the C++ layer for local AI
- •Port and enhance inference engines like llama.cpp and ONNX
- •Optimize models for faster loading and leaner performance
- •Ensure stability and optimization of the inference layer
- •Deploy machine learning models to edge devices
- •Collaborate with researchers on coding and training models
- •Transition models from research to production environments
- •Integrate AI features into existing products
Tools & Technologien
Sprachen
Englisch – verhandlungssicher
- LakeraVollzeitRemoteSeniorZürich
- Tether Operations Limited
Middleware Engineer - Fullstack(m/w/x)
VollzeitRemoteSeniorZürich - Speechify
Senior Software Engineer, AI Model Serving(m/w/x)
VollzeitRemoteSeniorZürich - Mistral
AI Scientist(m/w/x)
Vollzeitmit HomeofficeKeine AngabeZürich - Caplena
AI Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenZürich
Senior AI Inference Engineer - llama.cpp specialist(m/w/x)
Die KI-Suchmaschine für Jobs
Beschreibung
In this role, you will focus on optimizing AI inference engines for edge devices, collaborating with researchers to transition models into production, and enhancing existing products with cutting-edge machine learning features.
Lass KI die perfekten Jobs für dich finden!
Lade deinen CV hoch und die Nejo-KI findet passende Stellenangebote für dich.
Anforderungen
- •Excellent programming skills in C++
- •Experience in Javascript
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Solid track record in AI R&D
Ausbildung
Berufserfahrung
ca. 4 - 6 Jahre
Aufgaben
- •Work on the C++ layer for local AI
- •Port and enhance inference engines like llama.cpp and ONNX
- •Optimize models for faster loading and leaner performance
- •Ensure stability and optimization of the inference layer
- •Deploy machine learning models to edge devices
- •Collaborate with researchers on coding and training models
- •Transition models from research to production environments
- •Integrate AI features into existing products
Tools & Technologien
Sprachen
Englisch – verhandlungssicher
Über das Unternehmen
Tether Operations Limited
Branche
FinancialServices
Beschreibung
The company pioneers a global financial revolution with blockchain solutions, enabling secure and instant digital token transactions.
- Lakera
Senior AI Engineer(m/w/x)
VollzeitRemoteSeniorZürich - Tether Operations Limited
Middleware Engineer - Fullstack(m/w/x)
VollzeitRemoteSeniorZürich - Speechify
Senior Software Engineer, AI Model Serving(m/w/x)
VollzeitRemoteSeniorZürich - Mistral
AI Scientist(m/w/x)
Vollzeitmit HomeofficeKeine AngabeZürich - Caplena
AI Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenZürich