Senior AI Research Engineer, Model Inference
KI-Beschreibung
You focus on optimizing and implementing advanced inference techniques for language models, collaborating with teams to improve performance and ensure efficient deployment in mobile and edge environments.
Anforderungen
ca. 4 – 6 Jahre- •Proficiency in C++ and GPU kernel programming.
- •Proven expertise in GPU acceleration with Vulkan framework.
- •Strong background in quantization and mixed-precision model optimization.