Dein persönlicher KI-Karriere-Agent
Senior AI Inference Engineer(m/w/x)
Optimizing C++ systems for AI inference runtime at a fintech firm for digital asset tokenization. Strong Llama.cpp and ggml inference engine experience required. Remote work.
Anforderungen
- Excellent C++ programming skills
- Javascript experience (bonus)
- Strong Llama.cpp and ggml inference engine experience
- Good deep learning concepts and model architectures understanding
- Experience with transformers, LLMs, Diffusion models
- Ability to rapidly assimilate new technologies and techniques
- Degree in Computer Science, AI, Machine Learning, or related field
- Solid track record in AI R&D
- Javascript/Typescript experience
- Understanding of p2p technology difficulties, nuances, and importance
- Experience with Vulkan, Metal, or OpenCL
- Experience productionizing models
Aufgaben
- Manage C++ systems for AI inference.
- Ensure fast, reliable, and predictable model execution.
- Engineer runtime quality for AI models.
- Optimize startup behavior and memory pressure.
- Balance throughput and latency.
- Ensure long-session stability.
- Define and evolve core inference abstractions.
- Deploy machine learning models to edge devices.
- Utilize llama.cpp, ggml, and onnx frameworks.
- Collaborate with researchers on model development.
- Assist with coding and training models.
- Transition models from research to production.
- Integrate AI features into existing products.
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Bachelor-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Javascript
- Llama.cpp
- ggml
- transformers
- LLMs
- Diffusion models
- Typescript
- p2p technology
- Vulkan
- Metal
- OpenCL
Benefits
Flexibles Arbeiten
- Remote work
Noch nicht perfekt?
- lastminute.comVollzeitmit HomeofficeManagementChiasso
- ABB AG
R&D Senior Engineer Firmware(m/w/x)
Vollzeitmit HomeofficeSeniorQuartino - Jobtome
Senior Site Reliability Engineer(m/w/x)
VollzeitRemoteSeniorMendrisio - Tether Operations Limited
Technical Product Manager - Hadron(m/w/x)
VollzeitRemoteSeniorLugano - Jobtome
Senior Backend Developer(m/w/x)
VollzeitRemoteSeniorMendrisio
Senior AI Inference Engineer(m/w/x)
Optimizing C++ systems for AI inference runtime at a fintech firm for digital asset tokenization. Strong Llama.cpp and ggml inference engine experience required. Remote work.
Anforderungen
- Excellent C++ programming skills
- Javascript experience (bonus)
- Strong Llama.cpp and ggml inference engine experience
- Good deep learning concepts and model architectures understanding
- Experience with transformers, LLMs, Diffusion models
- Ability to rapidly assimilate new technologies and techniques
- Degree in Computer Science, AI, Machine Learning, or related field
- Solid track record in AI R&D
- Javascript/Typescript experience
- Understanding of p2p technology difficulties, nuances, and importance
- Experience with Vulkan, Metal, or OpenCL
- Experience productionizing models
Aufgaben
- Manage C++ systems for AI inference.
- Ensure fast, reliable, and predictable model execution.
- Engineer runtime quality for AI models.
- Optimize startup behavior and memory pressure.
- Balance throughput and latency.
- Ensure long-session stability.
- Define and evolve core inference abstractions.
- Deploy machine learning models to edge devices.
- Utilize llama.cpp, ggml, and onnx frameworks.
- Collaborate with researchers on model development.
- Assist with coding and training models.
- Transition models from research to production.
- Integrate AI features into existing products.
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Bachelor-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- C++
- Javascript
- Llama.cpp
- ggml
- transformers
- LLMs
- Diffusion models
- Typescript
- p2p technology
- Vulkan
- Metal
- OpenCL
Benefits
Flexibles Arbeiten
- Remote work
Über das Unternehmen
Tether
Branche
FinancialServices
Beschreibung
Tether pioneers a global financial revolution with cutting-edge solutions for businesses, enabling seamless integration of reserve-backed tokens across blockchains for instant, secure, and global digital token transactions.
Noch nicht perfekt?
- lastminute.com
Head of Data Platform Engineering(m/w/x)
Vollzeitmit HomeofficeManagementChiasso - ABB AG
R&D Senior Engineer Firmware(m/w/x)
Vollzeitmit HomeofficeSeniorQuartino - Jobtome
Senior Site Reliability Engineer(m/w/x)
VollzeitRemoteSeniorMendrisio - Tether Operations Limited
Technical Product Manager - Hadron(m/w/x)
VollzeitRemoteSeniorLugano - Jobtome
Senior Backend Developer(m/w/x)
VollzeitRemoteSeniorMendrisio