Skip to content
New Job?Nejo!

The AI Job Search Engine

ANAnthropic

Research Engineer / Research Scientist, Pre-training(m/w/x)

Zürich
from CHF 280,000 - 680,000 / year
Full-timeWith Home OfficeExperienced
AI/ML
Data Science

Developing multimodal architectures and data processing optimizers for large-scale model training. Expertise in Python, deep learning, and ML accelerators required. Optional equity donation matching and generous parental leave.

Requirements

  • Degree (BA required, MS or PhD preferred) in Computer Science, Machine Learning, or related field
  • Strong software engineering skills; proven track record building complex systems
  • Expertise in Python and deep learning frameworks
  • Experience with high-performance, large-scale ML systems, language modeling
  • Familiarity with ML Accelerators, Kubernetes, large-scale data processing
  • Strong problem-solving skills and results-oriented mindset
  • Excellent communication skills and collaborative work ability
  • Significant software engineering experience
  • Ability to balance research goals with practical engineering constraints
  • Willingness to take on tasks outside job description
  • Enjoyment of pair programming and collaborative work
  • Eagerness to learn about machine learning research
  • Enthusiasm for working in a cohesive team on large-scale AI research
  • Ambitious goals for AI safety and long-term progress
  • At least a Bachelor's degree in a related field or equivalent experience

Tasks

  • Conduct research on model architecture, algorithms, data processing, and optimizers
  • Implement solutions for model architecture, algorithms, data processing, and optimizers
  • Lead small research projects independently
  • Collaborate with team members on larger initiatives
  • Design, run, and analyze scientific experiments
  • Optimize and scale training infrastructure for efficiency and reliability
  • Develop and improve dev tooling for team productivity
  • Contribute to the entire stack, from optimization to model design
  • Optimize throughput of novel attention mechanisms
  • Propose Transformer variants
  • Experimentally compare Transformer variant performance
  • Prepare large-scale datasets for model consumption
  • Scale distributed training jobs to thousands of accelerators
  • Design fault tolerance strategies for training infrastructure
  • Create interactive visualizations of model internals

Work Experience

  • approx. 1 - 4 years

Education

  • Bachelor's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • Python
  • deep learning frameworks
  • ML systems
  • language modeling
  • ML Accelerators
  • Kubernetes

Benefits

Competitive Pay

  • Competitive compensation

Social Impact

  • Optional equity donation matching

More Vacation Days

  • Generous vacation

Generous Parental Leave

  • Generous parental leave

Flexible Working

  • Flexible working hours

Modern Office

  • Collaborative office space
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Anthropic and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Anthropic

    Research Engineer, Production Model Post Training(m/w/x)

    Full-timeWith HomeofficeExperienced
    Zürich
  • Mistral

    AI Scientist(m/w/x)

    Full-timeWith HomeofficeNot specified
    Zürich
  • ANYbotics

    Senior AI Research Engineer, Visual Perception(m/w/x)

    Full-timeWith HomeofficeSenior
    Zürich
  • Sony AI

    Research Intern for Deep Generative Modeling(m/w/x)

    Full-timeInternshipWith Homeoffice
    Schlieren
  • ANYbotics

    Senior AI Research Engineer in Visual Perception(m/w/x)

    Full-timeWith HomeofficeSenior
    Zürich
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes