Skip to content
New Job?Nejo!

Your personal AI career agent

CECerence Inc.

.ASR Linguistics Intern(m/w/x)

Aachen
Full-timeInternshipOn-site
AI/ML

Normalizing, tokenizing, and standardizing text for automotive AI training. Basic information security knowledge required. Work with 20+ linguists on a rotating project basis.

Requirements

  • Basic knowledge of information security and data privacy
  • Demonstrative knowledge of information security via internal training

Tasks

  • Normalize text for training models
  • Tokenize text by splitting on whitespace
  • Convert numbers, dates, and time expressions to words
  • Standardize spelling and minor text rewrites
  • Perform inverse normalization for user-friendly output
  • Format dates from 'first of July nineteen ninety two' to 'July 1, 1992'
  • Develop and maintain grammar rules for tokenizer and formatter
  • Handle hybrid systems with varying input formats
  • Support multilingual systems with 3+ languages
  • Process phenomena like numerals, time expressions, and addresses
  • Evaluate formatting accuracy against regression tests
  • Assess system's ability to handle non-standard input
  • Determine training data requirements for accurate formatting
  • Test multilingual system performance
  • Explore using LLMs to enhance grammar rules
  • Divide system into concept identification and formatting components
  • Evaluate LLMs and grammar rules for actual formatting tasks

Education

  • Currently in higher educationOR
  • Bachelor's degreeOR
  • Master's degree

Languages

  • EnglishBusiness Fluent
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Cerence Inc. and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Like this job?

Beta

Your Career Agent finds similar jobs for you every day.


  • Cerence GmbH

    Senior Speech & Language R&D Engineer(m/w/x)

    Full-timeOn-siteSenior
    Aachen
  • Cerence GmbH

    Senior Research Scientist(m/w/x)

    Full-timeOn-siteSenior
    Aachen
  • Cerence GmbH

    Senior Software Engineer – SW Integrator(m/w/x)

    Full-timeOn-siteSenior
    Aachen
  • Fraunhofer-Gesellschaft

    BT/MT: Deep learning and machine learning in production(m/w/x)

    Full-timeWorking StudentOn-site
    Aachen
  • FEV Consulting

    Intern Cost and Value Management(m/w/x)

    Full-timeInternshipOn-site
    Aachen
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes