The AI Job Search Engine
Lead AI Inference Engineer(m/w/x)
Description
In this role, you will lead a dynamic team to develop and deploy cutting-edge AI solutions, balancing technical tasks with team management. Your work will involve integrating advanced AI features into products and ensuring their reliable performance across various devices.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Excellent programming skills in C++
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Experience managing a small, specialized, cross-functional team
- •Genuine passion for building good products
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Extensive experience with Javascript/Typescript
- •Experience with AWS, containerization platforms, orchestration, and automated testing suites
- •Understanding of p2p technology
- •Experience with MLC, TVM or similar frameworks
- •Experience with Vulkan, CUDA
- •Experience productionizing models
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Lead a cross-functional team in AI development
- •Ensure reliable performance of local AI capabilities across devices
- •Balance hands-on technical work with team coordination
- •Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx
- •Collaborate with researchers to code, train, and transition models to production
- •Integrate AI features into existing products with the latest machine learning advancements
- •Manage a team of middleware, foundation, QA, and documentation engineers
- •Assess market position qualitatively and quantitatively against similar products
- •Leverage technical architects' expertise for robust architectural choices
- •Ensure stable releases by following internal release processes
Tools & Technologies
Languages
English – Business Fluent
- Tether Operations LimitedFull-timeRemoteSeniorLugano
- Tether Operations Limited
AI Research Engineer - Reinforcement Learning(m/w/x)
Full-timeRemoteExperiencedLugano - Tether Operations Limited
Software Architect(m/w/x)
Full-timeRemoteSeniorLugano - Tether Operations Limited
Senior Software Engineer - P2P(m/w/x)
Full-timeRemoteSeniorLugano - Jobtome
Senior Front-end Developer(m/w/x)
Full-timeRemoteSeniorMendrisio
Lead AI Inference Engineer(m/w/x)
The AI Job Search Engine
Description
In this role, you will lead a dynamic team to develop and deploy cutting-edge AI solutions, balancing technical tasks with team management. Your work will involve integrating advanced AI features into products and ensuring their reliable performance across various devices.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Excellent programming skills in C++
- •Strong experience with Llama.cpp and ggml inference engines
- •Good understanding of deep learning concepts and model architectures
- •Experience with transformers and LLMs
- •Demonstrated ability to rapidly assimilate new technologies and techniques
- •Experience managing a small, specialized, cross-functional team
- •Genuine passion for building good products
- •Degree in Computer Science, AI, Machine Learning, or related field
- •Extensive experience with Javascript/Typescript
- •Experience with AWS, containerization platforms, orchestration, and automated testing suites
- •Understanding of p2p technology
- •Experience with MLC, TVM or similar frameworks
- •Experience with Vulkan, CUDA
- •Experience productionizing models
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Lead a cross-functional team in AI development
- •Ensure reliable performance of local AI capabilities across devices
- •Balance hands-on technical work with team coordination
- •Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx
- •Collaborate with researchers to code, train, and transition models to production
- •Integrate AI features into existing products with the latest machine learning advancements
- •Manage a team of middleware, foundation, QA, and documentation engineers
- •Assess market position qualitatively and quantitatively against similar products
- •Leverage technical architects' expertise for robust architectural choices
- •Ensure stable releases by following internal release processes
Tools & Technologies
Languages
English – Business Fluent
About the Company
Tether Operations Limited
Industry
FinancialServices
Description
The company pioneers a global financial revolution with blockchain solutions, enabling secure and instant digital token transactions.
- Tether Operations Limited
AI Research Engineer - Pre training(m/w/x)
Full-timeRemoteSeniorLugano - Tether Operations Limited
AI Research Engineer - Reinforcement Learning(m/w/x)
Full-timeRemoteExperiencedLugano - Tether Operations Limited
Software Architect(m/w/x)
Full-timeRemoteSeniorLugano - Tether Operations Limited
Senior Software Engineer - P2P(m/w/x)
Full-timeRemoteSeniorLugano - Jobtome
Senior Front-end Developer(m/w/x)
Full-timeRemoteSeniorMendrisio