Skip to content
New Job?Nejo!

The AI Job Search Engine

DE
DeepL
25d ago

Senior Research Data Engineer(m/w/x)

Berlin
Full-timeWith Home OfficeSenior
AI/ML
Data Science

Description

You will drive frontier research by building massive data pipelines for foundation models. Your day-to-day involves scaling complex Python solutions to manage petabytes of multimodal data.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • Degree in scientific or technical field
  • Experience as Data Engineer or similar
  • Extensive experience with Python data ecosystem
  • Skills in EDA and ML feature engineering
  • Developing and deploying data pipelines
  • End-to-end ownership of data solutions
  • Experience with distributed computing and IaC
  • Excellent communication and collaboration skills
  • LLM training data preparation skills
  • Knowledge of NLP and GPU workflows
  • Experience with dynamic workflow orchestration
  • Linguistics expertise or multilingual skills
  • Fluency in C++, Go or Rust

Education

Bachelor's degree

Work Experience

approx. 4 - 6 years

Tasks

  • Collaborate on ambitious frontier research projects
  • Architect and build scalable data pipelines
  • Download and prepare multimodal unstructured data
  • Utilize Kubernetes, Dask, and Ray stacks
  • Debug and fix low-level open-source issues
  • Deploy complex Python solutions to cloud infrastructure
  • Operate data processing at a massive scale
  • Engineer solutions for text, code, and audio
  • Partner with scientists and platform teams
  • Champion data quality and availability standards
  • Ensure mission-critical reliability of pipeline jobs
  • Maintain high-quality code and documentation

Tools & Technologies

PythonDaskCeleryRayKubernetesAWSArgo WorkflowsC++GoRustGPU

Languages

EnglishBusiness Fluent

Benefits

Flexible Working

  • Hybrid work schedule
  • Flexible working hours

Informal Culture

  • Diverse international team
  • Open communication

Career Advancement

  • Regular feedback

Team Events

  • Regular in-person team events
  • Monthly full-day hacking sessions

More Vacation Days

  • 30 days of annual leave

Mental Health Support

  • Mental health resources

Additional Allowances

  • Tailored location-based benefits
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of DeepL and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Berlin
  • 4flow SE

    Senior Data Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • Rasa

    Senior Data Engineer(m/w/x)

    Full-timeRemoteSenior
    Berlin
  • Makersite GmbH

    Data Engineer(m/w/x)

    Full-timeRemoteExperienced
    Berlin
  • Tether

    Senior Research Engineer - Multimodal & Video Foundation Model(m/w/x)

    Full-timeRemoteSenior
    Berlin
  • dda diconium data GmbH

    Senior Data Engineer(m/w/x)

    Full-timeWith HomeofficeManagement
    Berlin
100+ View all similar jobs