The AI Job Search Engine
Senior Research Data Engineer(m/w/x)
Description
You will drive frontier research by building massive data pipelines for foundation models. Your day-to-day involves scaling complex Python solutions to manage petabytes of multimodal data.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Degree in scientific or technical field
- •Experience as Data Engineer or similar
- •Extensive experience with Python data ecosystem
- •Skills in EDA and ML feature engineering
- •Developing and deploying data pipelines
- •End-to-end ownership of data solutions
- •Experience with distributed computing and IaC
- •Excellent communication and collaboration skills
- •LLM training data preparation skills
- •Knowledge of NLP and GPU workflows
- •Experience with dynamic workflow orchestration
- •Linguistics expertise or multilingual skills
- •Fluency in C++, Go or Rust
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Collaborate on ambitious frontier research projects
- •Architect and build scalable data pipelines
- •Download and prepare multimodal unstructured data
- •Utilize Kubernetes, Dask, and Ray stacks
- •Debug and fix low-level open-source issues
- •Deploy complex Python solutions to cloud infrastructure
- •Operate data processing at a massive scale
- •Engineer solutions for text, code, and audio
- •Partner with scientists and platform teams
- •Champion data quality and availability standards
- •Ensure mission-critical reliability of pipeline jobs
- •Maintain high-quality code and documentation
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Hybrid work schedule
- •Flexible working hours
Informal Culture
- •Diverse international team
- •Open communication
Career Advancement
- •Regular feedback
Team Events
- •Regular in-person team events
- •Monthly full-day hacking sessions
More Vacation Days
- •30 days of annual leave
Mental Health Support
- •Mental health resources
Additional Allowances
- •Tailored location-based benefits
- 4flow SEFull-timeWith HomeofficeSeniorBerlin
- Rasa
Senior Data Engineer(m/w/x)
Full-timeRemoteSeniorBerlin - Makersite GmbH
Data Engineer(m/w/x)
Full-timeRemoteExperiencedBerlin - Tether
Senior Research Engineer - Multimodal & Video Foundation Model(m/w/x)
Full-timeRemoteSeniorBerlin - dda diconium data GmbH
Senior Data Engineer(m/w/x)
Full-timeWith HomeofficeManagementBerlin
Senior Research Data Engineer(m/w/x)
The AI Job Search Engine
Description
You will drive frontier research by building massive data pipelines for foundation models. Your day-to-day involves scaling complex Python solutions to manage petabytes of multimodal data.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Degree in scientific or technical field
- •Experience as Data Engineer or similar
- •Extensive experience with Python data ecosystem
- •Skills in EDA and ML feature engineering
- •Developing and deploying data pipelines
- •End-to-end ownership of data solutions
- •Experience with distributed computing and IaC
- •Excellent communication and collaboration skills
- •LLM training data preparation skills
- •Knowledge of NLP and GPU workflows
- •Experience with dynamic workflow orchestration
- •Linguistics expertise or multilingual skills
- •Fluency in C++, Go or Rust
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Collaborate on ambitious frontier research projects
- •Architect and build scalable data pipelines
- •Download and prepare multimodal unstructured data
- •Utilize Kubernetes, Dask, and Ray stacks
- •Debug and fix low-level open-source issues
- •Deploy complex Python solutions to cloud infrastructure
- •Operate data processing at a massive scale
- •Engineer solutions for text, code, and audio
- •Partner with scientists and platform teams
- •Champion data quality and availability standards
- •Ensure mission-critical reliability of pipeline jobs
- •Maintain high-quality code and documentation
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Hybrid work schedule
- •Flexible working hours
Informal Culture
- •Diverse international team
- •Open communication
Career Advancement
- •Regular feedback
Team Events
- •Regular in-person team events
- •Monthly full-day hacking sessions
More Vacation Days
- •30 days of annual leave
Mental Health Support
- •Mental health resources
Additional Allowances
- •Tailored location-based benefits
About the Company
DeepL
Industry
IT
Description
DeepL is a global communications platform powered by Language AI, focused on breaking down language barriers and improving communication.
- 4flow SE
Senior Data Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Rasa
Senior Data Engineer(m/w/x)
Full-timeRemoteSeniorBerlin - Makersite GmbH
Data Engineer(m/w/x)
Full-timeRemoteExperiencedBerlin - Tether
Senior Research Engineer - Multimodal & Video Foundation Model(m/w/x)
Full-timeRemoteSeniorBerlin - dda diconium data GmbH
Senior Data Engineer(m/w/x)
Full-timeWith HomeofficeManagementBerlin