Die KI-Suchmaschine für Jobs
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Developing multimodal agentic systems, scaling reinforcement learning across distributed systems for a design platform. Expertise in LLMs, VLMs, and Diffusion models, with strong experimental design skills, required. Equity packages, annual Vibe & Thrive allowance, flexible leave.
Anforderungen
- Expertise in MoEs, LLMs, VLMs, and Diffusion
- Experience modifying and adapting open-source models
- Strong experimental design and reproducibility skills
- Fluency in Python and PyTorch
- Experience building and evaluating agent loops
- Hands-on experience with policy optimization
- Experience with large-scale distributed training
- Experience with RL for MoE architectures
- Experience with video and audio modelling
- Experience with multi-agent settings
- Strength in alignment and safety evaluations
- Contributions to open-source or benchmarks
Aufgaben
- Develop multimodal agentic systems
- Design reward models and learning loops
- Scale reinforcement learning across distributed systems
- Optimize mixture-of-experts architectures
- Build scalable training and evaluation loops
- Construct datasets for post-training
- Design experiments for policy optimization
- Develop simulation and sandbox tasks
- Identify failure modes like hallucinations
- Implement offline suites and A/B tests
- Collaborate with product and platform teams
- Mentor teammates and share research findings
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- MoEs
- LLMs
- VLMs
- Diffusion models
- Python
- PyTorch
- RLHF
- RLAIF
- DPO
- IPO
- actor-critic
- PPO
- offline RL
Benefits
Attraktive Vergütung
- Equity packages
Großzügige Elternzeit
- Inclusive parental leave policy
Sonstige Zulagen
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
Noch nicht perfekt?
- CanvaVollzeitRemoteSeniorWien
- Canva
Senior Research Engineer - Design Generation(m/w/x)
Vollzeitmit HomeofficeSeniorWien - Canva
Machine Learning Engineering Manager - Evaluations(m/w/x)
Vollzeitmit HomeofficeManagementWien - Sportradar
Senior ML Engineer(m/w/x)
Vollzeitmit HomeofficeManagementWien - craftworks GmbH
Senior Data Scientist - Focus Industrial AI(m/w/x)
Vollzeitmit HomeofficeSeniorWienab 55.000 / Jahr
Senior Research Scientist - Reinforcement Learning, MoEs(m/w/x)
Developing multimodal agentic systems, scaling reinforcement learning across distributed systems for a design platform. Expertise in LLMs, VLMs, and Diffusion models, with strong experimental design skills, required. Equity packages, annual Vibe & Thrive allowance, flexible leave.
Anforderungen
- Expertise in MoEs, LLMs, VLMs, and Diffusion
- Experience modifying and adapting open-source models
- Strong experimental design and reproducibility skills
- Fluency in Python and PyTorch
- Experience building and evaluating agent loops
- Hands-on experience with policy optimization
- Experience with large-scale distributed training
- Experience with RL for MoE architectures
- Experience with video and audio modelling
- Experience with multi-agent settings
- Strength in alignment and safety evaluations
- Contributions to open-source or benchmarks
Aufgaben
- Develop multimodal agentic systems
- Design reward models and learning loops
- Scale reinforcement learning across distributed systems
- Optimize mixture-of-experts architectures
- Build scalable training and evaluation loops
- Construct datasets for post-training
- Design experiments for policy optimization
- Develop simulation and sandbox tasks
- Identify failure modes like hallucinations
- Implement offline suites and A/B tests
- Collaborate with product and platform teams
- Mentor teammates and share research findings
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- MoEs
- LLMs
- VLMs
- Diffusion models
- Python
- PyTorch
- RLHF
- RLAIF
- DPO
- IPO
- actor-critic
- PPO
- offline RL
Benefits
Attraktive Vergütung
- Equity packages
Großzügige Elternzeit
- Inclusive parental leave policy
Sonstige Zulagen
- Annual Vibe & Thrive allowance
Workation & Sabbatical
- Flexible leave options
Über das Unternehmen
Canva
Branche
IT
Beschreibung
The company is a fast-growing platform that redefines how the world experiences design.
Noch nicht perfekt?
- Canva
Senior Machine Learning Engineer - Agents data(m/w/x)
VollzeitRemoteSeniorWien - Canva
Senior Research Engineer - Design Generation(m/w/x)
Vollzeitmit HomeofficeSeniorWien - Canva
Machine Learning Engineering Manager - Evaluations(m/w/x)
Vollzeitmit HomeofficeManagementWien - Sportradar
Senior ML Engineer(m/w/x)
Vollzeitmit HomeofficeManagementWien - craftworks GmbH
Senior Data Scientist - Focus Industrial AI(m/w/x)
Vollzeitmit HomeofficeSeniorWienab 55.000 / Jahr