The AI Job Search Engine
Senior Machine Learning Engineer - Agents data(m/w/x)
Designing and building data pipelines for agent training from multimodal sources at fast-growing design platform. Hands-on experience with ML data pipelines and prompt engineering required. Equity packages, inclusive parental leave, flexible leave options.
Requirements
- Strong software engineering skills in Python
- Practical experience with prompt engineering
- Experience with ML data workflows
- Hands-on experience with data pipelines for ML training
- Familiarity with annotation tooling and data collection
- Understanding of ML training requirements
- Experience loading and writing large datasets to/from cloud infrastructure
- Strong communication skills
- Collaborative approach and ownership
- Experience with preference data collection for RLHF
- Familiarity with multimodal data
- Experience building synthetic data generation pipelines
- Background in data quality metrics and monitoring
- Contributions to dataset releases or benchmarks
Tasks
- Design and build data pipelines for agent training
- Collect, filter, deduplicate, format, and version data from multimodal sources
- Develop tools for dataset construction, including annotation workflows and synthetic data generation
- Own data quality by building validation frameworks and monitoring for drift and contamination
- Create evaluation datasets and benchmarks in collaboration with researchers
- Build and maintain infrastructure for efficient data loading, storage, and retrieval
- Collaborate with research scientists to translate research requirements into data specifications
- Document datasets thoroughly, including provenance and intended use cases
- Profile and optimize research code for training and inference efficiency
- Implement comprehensive test coverage for data pipelines and ML workflows
- Elevate codebase quality through code reviews and refactoring
- Contribute to team roadmaps by identifying data bottlenecks and proposing solutions
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- Ray
- AWS
Benefits
Competitive Pay
- Equity packages
Generous Parental Leave
- Inclusive parental leave policy
Additional Allowances
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
Not a perfect match?
- CanvaFull-timeWith HomeofficeSeniorWien
- Becton, Dickinson and Company
Senior Machine Learning Engineer(m/w/x)
Full-time/Part-timeWith HomeofficeSeniorWienfrom 52,136 / year - Canva
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Full-timeWith HomeofficeSeniorWien - Sportradar
Senior ML Engineer(m/w/x)
Full-timeWith HomeofficeManagementWien - RHI Magnesita GmbH
Senior Data Scientist(m/w/x)
Full-timeWith HomeofficeSeniorWienfrom 65,000 / year
Senior Machine Learning Engineer - Agents data(m/w/x)
Designing and building data pipelines for agent training from multimodal sources at fast-growing design platform. Hands-on experience with ML data pipelines and prompt engineering required. Equity packages, inclusive parental leave, flexible leave options.
Requirements
- Strong software engineering skills in Python
- Practical experience with prompt engineering
- Experience with ML data workflows
- Hands-on experience with data pipelines for ML training
- Familiarity with annotation tooling and data collection
- Understanding of ML training requirements
- Experience loading and writing large datasets to/from cloud infrastructure
- Strong communication skills
- Collaborative approach and ownership
- Experience with preference data collection for RLHF
- Familiarity with multimodal data
- Experience building synthetic data generation pipelines
- Background in data quality metrics and monitoring
- Contributions to dataset releases or benchmarks
Tasks
- Design and build data pipelines for agent training
- Collect, filter, deduplicate, format, and version data from multimodal sources
- Develop tools for dataset construction, including annotation workflows and synthetic data generation
- Own data quality by building validation frameworks and monitoring for drift and contamination
- Create evaluation datasets and benchmarks in collaboration with researchers
- Build and maintain infrastructure for efficient data loading, storage, and retrieval
- Collaborate with research scientists to translate research requirements into data specifications
- Document datasets thoroughly, including provenance and intended use cases
- Profile and optimize research code for training and inference efficiency
- Implement comprehensive test coverage for data pipelines and ML workflows
- Elevate codebase quality through code reviews and refactoring
- Contribute to team roadmaps by identifying data bottlenecks and proposing solutions
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- Ray
- AWS
Benefits
Competitive Pay
- Equity packages
Generous Parental Leave
- Inclusive parental leave policy
Additional Allowances
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
About the Company
Canva
Industry
Other
Description
The company is a fast-growing platform that redefines how the world experiences design.
Not a perfect match?
- Canva
Senior Research Engineer - Design Generation(m/w/x)
Full-timeWith HomeofficeSeniorWien - Becton, Dickinson and Company
Senior Machine Learning Engineer(m/w/x)
Full-time/Part-timeWith HomeofficeSeniorWienfrom 52,136 / year - Canva
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Full-timeWith HomeofficeSeniorWien - Sportradar
Senior ML Engineer(m/w/x)
Full-timeWith HomeofficeManagementWien - RHI Magnesita GmbH
Senior Data Scientist(m/w/x)
Full-timeWith HomeofficeSeniorWienfrom 65,000 / year