Skip to content
New Job?Nejo!

The AI Job Search Engine

TETether

Senior AI Inference Engineer(m/w/x)

Lugano
Full-timeRemoteSenior
AI/ML

Optimizing C++ systems for AI inference runtime at a fintech firm for digital asset tokenization. Strong Llama.cpp and ggml inference engine experience required. Remote work.

Requirements

  • Excellent C++ programming skills
  • Javascript experience (bonus)
  • Strong Llama.cpp and ggml inference engine experience
  • Good deep learning concepts and model architectures understanding
  • Experience with transformers, LLMs, Diffusion models
  • Ability to rapidly assimilate new technologies and techniques
  • Degree in Computer Science, AI, Machine Learning, or related field
  • Solid track record in AI R&D
  • Javascript/Typescript experience
  • Understanding of p2p technology difficulties, nuances, and importance
  • Experience with Vulkan, Metal, or OpenCL
  • Experience productionizing models

Tasks

  • Manage C++ systems for AI inference.
  • Ensure fast, reliable, and predictable model execution.
  • Engineer runtime quality for AI models.
  • Optimize startup behavior and memory pressure.
  • Balance throughput and latency.
  • Ensure long-session stability.
  • Define and evolve core inference abstractions.
  • Deploy machine learning models to edge devices.
  • Utilize llama.cpp, ggml, and onnx frameworks.
  • Collaborate with researchers on model development.
  • Assist with coding and training models.
  • Transition models from research to production.
  • Integrate AI features into existing products.

Work Experience

  • approx. 4 - 6 years

Education

  • Bachelor's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • C++
  • Javascript
  • Llama.cpp
  • ggml
  • transformers
  • LLMs
  • Diffusion models
  • Typescript
  • p2p technology
  • Vulkan
  • Metal
  • OpenCL

Benefits

Flexible Working

  • Remote work
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Tether Operations Limited

    Lead AI Inference Engineer(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • lastminute.com

    Head of Data Platform Engineering(m/w/x)

    Full-timeWith HomeofficeManagement
    Chiasso
  • ABB AG

    R&D Senior Engineer Firmware(m/w/x)

    Full-timeWith HomeofficeSenior
    Quartino
  • Jobtome

    Senior Site Reliability Engineer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
  • Jobtome

    Senior Backend Developer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
View all 47+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes