The AI Job Search Engine
Optimizing C++ systems for AI inference runtime at a fintech firm for digital asset tokenization. Strong Llama.cpp and ggml inference engine experience required. Remote work.
Requirements
- Excellent C++ programming skills
- Javascript experience (bonus)
- Strong Llama.cpp and ggml inference engine experience
- Good deep learning concepts and model architectures understanding
- Experience with transformers, LLMs, Diffusion models
- Ability to rapidly assimilate new technologies and techniques
- Degree in Computer Science, AI, Machine Learning, or related field
- Solid track record in AI R&D
- Javascript/Typescript experience
- Understanding of p2p technology difficulties, nuances, and importance
- Experience with Vulkan, Metal, or OpenCL
- Experience productionizing models
Tasks
- Manage C++ systems for AI inference.
- Ensure fast, reliable, and predictable model execution.
- Engineer runtime quality for AI models.
- Optimize startup behavior and memory pressure.
- Balance throughput and latency.
- Ensure long-session stability.
- Define and evolve core inference abstractions.
- Deploy machine learning models to edge devices.
- Utilize llama.cpp, ggml, and onnx frameworks.
- Collaborate with researchers on model development.
- Assist with coding and training models.
- Transition models from research to production.
- Integrate AI features into existing products.
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Javascript
- Llama.cpp
- ggml
- transformers
- LLMs
- Diffusion models
- Typescript
- p2p technology
- Vulkan
- Metal
- OpenCL
Benefits
Flexible Working
- Remote work
Not a perfect match?
- Tether Operations LimitedFull-timeRemoteSeniorLugano
- lastminute.com
Head of Data Platform Engineering(m/w/x)
Full-timeWith HomeofficeManagementChiasso - ABB AG
R&D Senior Engineer Firmware(m/w/x)
Full-timeWith HomeofficeSeniorQuartino - Jobtome
Senior Site Reliability Engineer(m/w/x)
Full-timeRemoteSeniorMendrisio - Jobtome
Senior Backend Developer(m/w/x)
Full-timeRemoteSeniorMendrisio
Optimizing C++ systems for AI inference runtime at a fintech firm for digital asset tokenization. Strong Llama.cpp and ggml inference engine experience required. Remote work.
Requirements
- Excellent C++ programming skills
- Javascript experience (bonus)
- Strong Llama.cpp and ggml inference engine experience
- Good deep learning concepts and model architectures understanding
- Experience with transformers, LLMs, Diffusion models
- Ability to rapidly assimilate new technologies and techniques
- Degree in Computer Science, AI, Machine Learning, or related field
- Solid track record in AI R&D
- Javascript/Typescript experience
- Understanding of p2p technology difficulties, nuances, and importance
- Experience with Vulkan, Metal, or OpenCL
- Experience productionizing models
Tasks
- Manage C++ systems for AI inference.
- Ensure fast, reliable, and predictable model execution.
- Engineer runtime quality for AI models.
- Optimize startup behavior and memory pressure.
- Balance throughput and latency.
- Ensure long-session stability.
- Define and evolve core inference abstractions.
- Deploy machine learning models to edge devices.
- Utilize llama.cpp, ggml, and onnx frameworks.
- Collaborate with researchers on model development.
- Assist with coding and training models.
- Transition models from research to production.
- Integrate AI features into existing products.
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- C++
- Javascript
- Llama.cpp
- ggml
- transformers
- LLMs
- Diffusion models
- Typescript
- p2p technology
- Vulkan
- Metal
- OpenCL
Benefits
Flexible Working
- Remote work
About the Company
Tether
Industry
FinancialServices
Description
Tether pioneers a global financial revolution with cutting-edge solutions for businesses, enabling seamless integration of reserve-backed tokens across blockchains for instant, secure, and global digital token transactions.
Not a perfect match?
- Tether Operations Limited
Lead AI Inference Engineer(m/w/x)
Full-timeRemoteSeniorLugano - lastminute.com
Head of Data Platform Engineering(m/w/x)
Full-timeWith HomeofficeManagementChiasso - ABB AG
R&D Senior Engineer Firmware(m/w/x)
Full-timeWith HomeofficeSeniorQuartino - Jobtome
Senior Site Reliability Engineer(m/w/x)
Full-timeRemoteSeniorMendrisio - Jobtome
Senior Backend Developer(m/w/x)
Full-timeRemoteSeniorMendrisio