The AI Job Search Engine
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Developing multimodal agentic systems, scaling reinforcement learning across distributed systems for a design platform. Expertise in LLMs, VLMs, and Diffusion models, with strong experimental design skills, required. Equity packages, annual Vibe & Thrive allowance, flexible leave.
Requirements
- Expertise in MoEs, LLMs, VLMs, and Diffusion
- Experience modifying and adapting open-source models
- Strong experimental design and reproducibility skills
- Fluency in Python and PyTorch
- Experience building and evaluating agent loops
- Hands-on experience with policy optimization
- Experience with large-scale distributed training
- Experience with RL for MoE architectures
- Experience with video and audio modelling
- Experience with multi-agent settings
- Strength in alignment and safety evaluations
- Contributions to open-source or benchmarks
Tasks
- Develop multimodal agentic systems
- Design reward models and learning loops
- Scale reinforcement learning across distributed systems
- Optimize mixture-of-experts architectures
- Build scalable training and evaluation loops
- Construct datasets for post-training
- Design experiments for policy optimization
- Develop simulation and sandbox tasks
- Identify failure modes like hallucinations
- Implement offline suites and A/B tests
- Collaborate with product and platform teams
- Mentor teammates and share research findings
Work Experience
- approx. 4 - 6 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- MoEs
- LLMs
- VLMs
- Diffusion models
- Python
- PyTorch
- RLHF
- RLAIF
- DPO
- IPO
- actor-critic
- PPO
- offline RL
Benefits
Competitive Pay
- Equity packages
Generous Parental Leave
- Inclusive parental leave policy
Additional Allowances
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
Not a perfect match?
- CanvaFull-timeRemoteSeniorWien
- Canva
Senior Research Engineer - Design Generation(m/w/x)
Full-timeWith HomeofficeSeniorWien - Canva
Machine Learning Engineering Manager - Evaluations(m/w/x)
Full-timeWith HomeofficeManagementWien - Sportradar
Senior ML Engineer(m/w/x)
Full-timeWith HomeofficeManagementWien - craftworks GmbH
Senior Data Scientist - Focus Industrial AI(m/w/x)
Full-timeWith HomeofficeSeniorWienfrom 55,000 / year
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Developing multimodal agentic systems, scaling reinforcement learning across distributed systems for a design platform. Expertise in LLMs, VLMs, and Diffusion models, with strong experimental design skills, required. Equity packages, annual Vibe & Thrive allowance, flexible leave.
Requirements
- Expertise in MoEs, LLMs, VLMs, and Diffusion
- Experience modifying and adapting open-source models
- Strong experimental design and reproducibility skills
- Fluency in Python and PyTorch
- Experience building and evaluating agent loops
- Hands-on experience with policy optimization
- Experience with large-scale distributed training
- Experience with RL for MoE architectures
- Experience with video and audio modelling
- Experience with multi-agent settings
- Strength in alignment and safety evaluations
- Contributions to open-source or benchmarks
Tasks
- Develop multimodal agentic systems
- Design reward models and learning loops
- Scale reinforcement learning across distributed systems
- Optimize mixture-of-experts architectures
- Build scalable training and evaluation loops
- Construct datasets for post-training
- Design experiments for policy optimization
- Develop simulation and sandbox tasks
- Identify failure modes like hallucinations
- Implement offline suites and A/B tests
- Collaborate with product and platform teams
- Mentor teammates and share research findings
Work Experience
- approx. 4 - 6 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- MoEs
- LLMs
- VLMs
- Diffusion models
- Python
- PyTorch
- RLHF
- RLAIF
- DPO
- IPO
- actor-critic
- PPO
- offline RL
Benefits
Competitive Pay
- Equity packages
Generous Parental Leave
- Inclusive parental leave policy
Additional Allowances
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
About the Company
Canva
Industry
IT
Description
The company is a fast-growing platform that redefines how the world experiences design.
Not a perfect match?
- Canva
Senior Machine Learning Engineer - Agents data(m/w/x)
Full-timeRemoteSeniorWien - Canva
Senior Research Engineer - Design Generation(m/w/x)
Full-timeWith HomeofficeSeniorWien - Canva
Machine Learning Engineering Manager - Evaluations(m/w/x)
Full-timeWith HomeofficeManagementWien - Sportradar
Senior ML Engineer(m/w/x)
Full-timeWith HomeofficeManagementWien - craftworks GmbH
Senior Data Scientist - Focus Industrial AI(m/w/x)
Full-timeWith HomeofficeSeniorWienfrom 55,000 / year