Your personal AI career agent
ML Data Engineer(m/w/x)
Audio-to-text pipeline setup and speaker diarization for mental health app data, including ASR model evaluation. 2-4 years data pipeline experience required. 30 days vacation, sponsored lunch, VIP treatment in Schön Klinik's portfolio.
Requirements
- 2–4 years experience with data pipelines
- Experience with Python
- Ideally experience with Go
- Advantageous experience with audio/speech processing
- Solid SQL knowledge
- Solid NoSQL knowledge
- Solid cloud infrastructure knowledge (AWS/GCP/Azure)
- Pragmatic, hands-on work approach
- Ability to build solutions independently
- Fluent English skills
- German skills (plus)
- Nice-to-have experience with MLOps tools
- Nice-to-have experience with Airflow
- Nice-to-have experience with Prefect
- Nice-to-have experience with DVC
Tasks
- Set up and operate the audio-to-text pipeline.
- Evaluate ASR models (Whisper, AssemblyAI, specialized providers).
- Perform speaker diarization for therapist/patient separation.
- Automate data ingestion, transcription, quality checks, and storage.
- Integrate and support annotation tools (Label Studio, Prodigy).
- Manage database architecture for transcripts and metadata.
- Monitor data quality and handle errors.
- Interface with vAI platform for RAG data delivery.
Work Experience
- 2 - 4 years
Education
- Vocational certificationOR
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
- German – Business Fluent
Tools & Technologies
- Python
- Go
- SQL
- NoSQL
- AWS
- GCP
- Azure
- Airflow
- Prefect
- DVC
Benefits
More Vacation Days
- 30 days of vacation
Free or Subsidized Food
- Sponsored lunch
Other Benefits
- VIP treatment in Schön Klinik's portfolio
Retirement Plans
- Pension plan
Childcare
- Child care
Company Bike
- Company bike lease options
Corporate Discounts
- Employee discounts with more than 600 brands
Healthcare & Fitness
- EGYM Wellpass
Not a perfect match?
- Data4LifeFull-timeWith HomeofficeExperiencedBerlin, Potsdam
- Statista
Data Engineer - Healthcare(m/w/x)
Full-timeWith HomeofficeExperiencedHamburg, Berlin - InterWorks
Data Engineer(m/w/x)
Full-timeRemoteExperiencedBerlinfrom 100,000 / year - LiveEO GmbH
Senior Data Engineer - Remote Sensing & AI Pipelines(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Clue
(Senior) ML Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin
ML Data Engineer(m/w/x)
Audio-to-text pipeline setup and speaker diarization for mental health app data, including ASR model evaluation. 2-4 years data pipeline experience required. 30 days vacation, sponsored lunch, VIP treatment in Schön Klinik's portfolio.
Requirements
- 2–4 years experience with data pipelines
- Experience with Python
- Ideally experience with Go
- Advantageous experience with audio/speech processing
- Solid SQL knowledge
- Solid NoSQL knowledge
- Solid cloud infrastructure knowledge (AWS/GCP/Azure)
- Pragmatic, hands-on work approach
- Ability to build solutions independently
- Fluent English skills
- German skills (plus)
- Nice-to-have experience with MLOps tools
- Nice-to-have experience with Airflow
- Nice-to-have experience with Prefect
- Nice-to-have experience with DVC
Tasks
- Set up and operate the audio-to-text pipeline.
- Evaluate ASR models (Whisper, AssemblyAI, specialized providers).
- Perform speaker diarization for therapist/patient separation.
- Automate data ingestion, transcription, quality checks, and storage.
- Integrate and support annotation tools (Label Studio, Prodigy).
- Manage database architecture for transcripts and metadata.
- Monitor data quality and handle errors.
- Interface with vAI platform for RAG data delivery.
Work Experience
- 2 - 4 years
Education
- Vocational certificationOR
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
- German – Business Fluent
Tools & Technologies
- Python
- Go
- SQL
- NoSQL
- AWS
- GCP
- Azure
- Airflow
- Prefect
- DVC
Benefits
More Vacation Days
- 30 days of vacation
Free or Subsidized Food
- Sponsored lunch
Other Benefits
- VIP treatment in Schön Klinik's portfolio
Retirement Plans
- Pension plan
Childcare
- Child care
Company Bike
- Company bike lease options
Corporate Discounts
- Employee discounts with more than 600 brands
Healthcare & Fitness
- EGYM Wellpass
About the Company
MindDoc
Industry
Healthcare
Description
Das Unternehmen hat sich zu Deutschlands führendem Anbieter von videobasierter Psychotherapie entwickelt und bietet eine App zur mentalen Gesundheit an.
Not a perfect match?
- Data4Life
Data Engineer(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin, Potsdam - Statista
Data Engineer - Healthcare(m/w/x)
Full-timeWith HomeofficeExperiencedHamburg, Berlin - InterWorks
Data Engineer(m/w/x)
Full-timeRemoteExperiencedBerlinfrom 100,000 / year - LiveEO GmbH
Senior Data Engineer - Remote Sensing & AI Pipelines(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Clue
(Senior) ML Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin