Skip to content
New Job?Nejo!

The AI Job Search Engine

CACanva

Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)

Wien
Full-timeWith Home OfficeSenior
AI/ML
Data Science

Developing multimodal agentic systems, scaling reinforcement learning across distributed systems for a design platform. Expertise in LLMs, VLMs, and Diffusion models, with strong experimental design skills, required. Equity packages, annual Vibe & Thrive allowance, flexible leave.

Requirements

  • Expertise in MoEs, LLMs, VLMs, and Diffusion
  • Experience modifying and adapting open-source models
  • Strong experimental design and reproducibility skills
  • Fluency in Python and PyTorch
  • Experience building and evaluating agent loops
  • Hands-on experience with policy optimization
  • Experience with large-scale distributed training
  • Experience with RL for MoE architectures
  • Experience with video and audio modelling
  • Experience with multi-agent settings
  • Strength in alignment and safety evaluations
  • Contributions to open-source or benchmarks

Tasks

  • Develop multimodal agentic systems
  • Design reward models and learning loops
  • Scale reinforcement learning across distributed systems
  • Optimize mixture-of-experts architectures
  • Build scalable training and evaluation loops
  • Construct datasets for post-training
  • Design experiments for policy optimization
  • Develop simulation and sandbox tasks
  • Identify failure modes like hallucinations
  • Implement offline suites and A/B tests
  • Collaborate with product and platform teams
  • Mentor teammates and share research findings

Work Experience

  • approx. 4 - 6 years

Education

  • Master's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • MoEs
  • LLMs
  • VLMs
  • Diffusion models
  • Python
  • PyTorch
  • RLHF
  • RLAIF
  • DPO
  • IPO
  • actor-critic
  • PPO
  • offline RL

Benefits

Competitive Pay

  • Equity packages

Generous Parental Leave

  • Inclusive parental leave policy

Additional Allowances

  • Annual Vibe & Thrive allowance

Workation & Sabbatical

  • Flexible leave options
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Canva and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Canva

    Senior Machine Learning Engineer - Agents data(m/w/x)

    Full-timeRemoteSenior
    Wien
  • Canva

    Senior Research Engineer - Design Generation(m/w/x)

    Full-timeWith HomeofficeSenior
    Wien
  • Canva

    Machine Learning Engineering Manager - Evaluations(m/w/x)

    Full-timeWith HomeofficeManagement
    Wien
  • Sportradar

    Senior ML Engineer(m/w/x)

    Full-timeWith HomeofficeManagement
    Wien
  • craftworks GmbH

    Senior Data Scientist - Focus Industrial AI(m/w/x)

    Full-timeWith HomeofficeSenior
    Wien
    from 55,000 / year
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes