Skip to content
New Job?Nejo!

The AI Job Search Engine

TETether Operations Limited

Lead AI Inference Engineer(m/w/x)

Lugano
Full-timeRemoteSenior
AI/ML

Developing and optimizing C++ inference backbone for AI models in blockchain financial solutions. Strong Llama.cpp and ggml inference engines experience required. Remote work model.

Requirements

  • Excellent C++ programming skills
  • Strong Llama.cpp and ggml inference engines experience
  • Good understanding of deep learning concepts and architectures
  • Experience with transformers, LLMs, Diffusion Models
  • Ability to rapidly assimilate new technologies and techniques
  • Experience managing small cross-functional teams (3-5 people)
  • Passion for building products that improve lives
  • Degree in Computer Science, AI, Machine Learning, or related field
  • Solid track record in AI R&D
  • Extensive Javascript/Typescript experience
  • Understanding of p2p technology nuances and importance
  • Experience with Vulkan, Metal, or OpenCL
  • Experience productionizing models

Tasks

  • Develop and maintain the C++ inference backbone.
  • Optimize model performance for real user hardware.
  • Engineer runtime quality for AI models.
  • Optimize startup behavior and memory pressure.
  • Balance throughput and latency.
  • Ensure long-session stability.
  • Deploy ML models to edge devices (llama.cpp, ggml, onnx).
  • Collaborate with researchers on model coding and training.
  • Transition models from research to production environments.
  • Integrate AI features into existing products.
  • Enrich products with machine learning advancements.
  • Manage a cross-functional engineering team.
  • Lead middleware, C++, QA, and documentation teams.
  • Produce high-quality deliverables.
  • Assess market position of similar products.
  • Perform qualitative and quantitative market analysis.
  • Leverage technical architect expertise.
  • Ensure robust architectural choices.
  • Maintain high code quality.
  • Ensure stable product releases.
  • Follow precise internal release processes.

Work Experience

  • approx. 4 - 6 years

Education

  • Bachelor's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • C++
  • Llama.cpp
  • ggml
  • transformers
  • LLMs
  • Diffusion Models
  • Javascript
  • Typescript
  • p2p technology
  • Vulkan
  • Metal
  • OpenCL

Benefits

Flexible Working

  • Remote work
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether Operations Limited and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Tether

    Senior AI Inference Engineer(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • lastminute.com

    Head of Data Platform Engineering(m/w/x)

    Full-timeWith HomeofficeManagement
    Chiasso
  • Jobtome

    Senior Front-end Developer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
  • Jobtome

    Senior Backend Developer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
  • ABB AG

    R&D Senior Engineer Firmware(m/w/x)

    Full-timeWith HomeofficeSenior
    Quartino
View all 47+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes