Skip to content
New Job?Nejo!

The AI Job Search Engine

CE
Cerence Communications Technology (Shanghai) Co., Ltd.
2mo ago

Sr. Principal TTS Researcher(m/w/x)

Ulm
Full-timeOn-siteSenior
AI/ML

Description

In this role, you will be at the forefront of TTS innovation, developing advanced systems and optimizing models while collaborating with a global team. Your expertise in machine learning and speech processing will drive meaningful advancements in voice technology for the automotive industry.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • 8+ years of hands-on experience in TTS system development
  • Proficiency in C/C++ and Python
  • Strong background in NLP techniques or speech signal processing
  • Experience with linguistic tools and phonetic knowledge
  • Familiarity with transformer-based language models for prosody prediction
  • Deep understanding of autoregressive and non-autoregressive acoustic models
  • Experience optimizing models via quantization, pruning, or knowledge distillation
  • Knowledge of speech codecs and real-time streaming protocols
  • Production experience with ONNX Runtime, TensorRT, or TorchScript
  • Experience with zero-shot, one-shot, or few-shot voice cloning
  • Skilled GPU/TPU cluster and grid user
  • Fluent English
  • Track record of publications in INTERSPEECH, ICASSP, or NeurIPS
  • Track working experience in world-level TTS or AI company
  • Proven ability to lead cross-functional groups
  • Basic knowledge of information security and data privacy requirements
  • Demonstrative knowledge of information security through training programs

Work Experience

8 years

Tasks

  • Develop TTS systems with frontend and backend components
  • Utilize C/C++ and Python for system development
  • Implement ML frameworks like PyTorch and TensorFlow
  • Apply NLP techniques and speech signal processing
  • Use linguistic tools such as Festival
  • Leverage phonetic knowledge for TTS applications
  • Employ transformer-based models for prosody prediction
  • Optimize models through quantization and pruning
  • Utilize speech codecs like Opus and MELP
  • Implement real-time streaming protocols
  • Work with ONNX Runtime, TensorRT, or TorchScript
  • Develop zero-shot, one-shot, or few-shot voice cloning systems
  • Create emotional TTS systems
  • Manage GPU/TPU clusters and grids
  • Publish research in INTERSPEECH, ICASSP, or NeurIPS
  • Lead cross-functional teams to deliver TTS solutions

Tools & Technologies

C/C++PythonPyTorchTensorFlowFestivalONNX RuntimeTensorRTTorchScript

Languages

EnglishNative

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Cerence Communications Technology (Shanghai) Co., Ltd. and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Ulm
  • Cerence Inc.

    Senior TTS Research Engineer(m/w/x)

    Full-timeOn-siteSenior
    Ulm
  • Cerence Inc.

    Senior Speech & Language R&D Engineer(m/w/x)

    Full-timeOn-siteSenior
    Ulm
  • Cerence GmbH

    Senior Audio / Software Engineer(m/w/x)

    Full-timeOn-siteSenior
    Ulm
  • Cerence GmbH

    Senior Software Engineer - Android(m/w/x)

    Full-timeOn-siteSenior
    Ulm
  • Cerence GmbH

    Senior Software Engineer(m/w/x)

    Full-timeOn-siteSenior
    Ulm
100+ View all similar jobs