The AI Job Search Engine
Senior AI Inference Engineer - llama.cpp specialist(m/w/x)
Description
In this role, you will focus on optimizing AI inference engines for edge devices, collaborating with researchers to transition models into production, and enhancing existing products with cutting-edge machine learning features.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Excellent programming skills in C++
- •Experience in Javascript
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Solid track record in AI R&D
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Work on the C++ layer for local AI
- •Port and enhance inference engines like llama.cpp and ONNX
- •Optimize models for faster loading and leaner performance
- •Ensure stability and optimization of the inference layer
- •Deploy machine learning models to edge devices
- •Collaborate with researchers on coding and training models
- •Transition models from research to production environments
- •Integrate AI features into existing products
Tools & Technologies
Languages
English – Business Fluent
- LakeraFull-timeRemoteSeniorZürich
- Tether Operations Limited
Middleware Engineer - Fullstack(m/w/x)
Full-timeRemoteSeniorZürich - Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - Speechify
Senior Software Engineer, AI Model Serving(m/w/x)
Full-timeRemoteSeniorZürich - Snyk Switzerland AG
Senior Incubation Engineer(m/w/x)
Full-timeWith HomeofficeSeniorZürich
Senior AI Inference Engineer - llama.cpp specialist(m/w/x)
The AI Job Search Engine
Description
In this role, you will focus on optimizing AI inference engines for edge devices, collaborating with researchers to transition models into production, and enhancing existing products with cutting-edge machine learning features.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Excellent programming skills in C++
- •Experience in Javascript
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Solid track record in AI R&D
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Work on the C++ layer for local AI
- •Port and enhance inference engines like llama.cpp and ONNX
- •Optimize models for faster loading and leaner performance
- •Ensure stability and optimization of the inference layer
- •Deploy machine learning models to edge devices
- •Collaborate with researchers on coding and training models
- •Transition models from research to production environments
- •Integrate AI features into existing products
Tools & Technologies
Languages
English – Business Fluent
About the Company
Tether Operations Limited
Industry
FinancialServices
Description
The company pioneers a global financial revolution with blockchain solutions, enabling secure and instant digital token transactions.
- Lakera
Senior AI Engineer(m/w/x)
Full-timeRemoteSeniorZürich - Tether Operations Limited
Middleware Engineer - Fullstack(m/w/x)
Full-timeRemoteSeniorZürich - Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - Speechify
Senior Software Engineer, AI Model Serving(m/w/x)
Full-timeRemoteSeniorZürich - Snyk Switzerland AG
Senior Incubation Engineer(m/w/x)
Full-timeWith HomeofficeSeniorZürich