New Job?Nejo!

The AI Job Search Engine

TE
Tether Operations Limited
20d ago

Lead AI Inference Engineer(m/w/x)

Lugano
Full-timeRemoteSenior
AI/ML

Description

In this role, you will lead a dynamic team to develop and deploy cutting-edge AI solutions, balancing technical tasks with team management. Your work will involve integrating advanced AI features into products and ensuring their reliable performance across various devices.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • Excellent programming skills in C++
  • Strong experience with Llama.cpp and ggml inference engines
  • Good understanding of deep learning concepts and model architectures
  • Experience with transformers and LLMs
  • Demonstrated ability to rapidly assimilate new technologies and techniques
  • Experience managing a small, specialized, cross-functional team
  • Genuine passion for building good products
  • Degree in Computer Science, AI, Machine Learning, or related field
  • Extensive experience with Javascript/Typescript
  • Experience with AWS, containerization platforms, orchestration, and automated testing suites
  • Understanding of p2p technology
  • Experience with MLC, TVM or similar frameworks
  • Experience with Vulkan, CUDA
  • Experience productionizing models

Education

Bachelor's degree

Work Experience

approx. 4 - 6 years

Tasks

  • Lead a cross-functional team in AI development
  • Ensure reliable performance of local AI capabilities across devices
  • Balance hands-on technical work with team coordination
  • Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx
  • Collaborate with researchers to code, train, and transition models to production
  • Integrate AI features into existing products with the latest machine learning advancements
  • Manage a team of middleware, foundation, QA, and documentation engineers
  • Assess market position qualitatively and quantitatively against similar products
  • Leverage technical architects' expertise for robust architectural choices
  • Ensure stable releases by following internal release processes

Tools & Technologies

C++Llama.cppggmldeep learningtransformersLLMsAWSJavascriptTypescriptMLCTVMVulkanCUDA

Languages

EnglishBusiness Fluent

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether Operations Limited and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
30 Similar Jobs in Lugano
  • Tether Operations Limited

    AI Research Engineer - Pre training(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • Tether Operations Limited

    AI Research Engineer - Reinforcement Learning(m/w/x)

    Full-timeRemoteExperienced
    Lugano
  • Tether Operations Limited

    Software Architect(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • Tether Operations Limited

    Senior Software Engineer - P2P(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • Jobtome

    Senior Front-end Developer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
30+ View all similar jobs