Your personal AI career agent
Research Engineer / Research Scientist, Pre-training(m/w/x)
Developing multimodal architectures and data processing optimizers for large-scale model training. Expertise in Python, deep learning, and ML accelerators required. Optional equity donation matching and generous parental leave.
Requirements
- Degree (BA required, MS or PhD preferred) in Computer Science, Machine Learning, or related field
- Strong software engineering skills; proven track record building complex systems
- Expertise in Python and deep learning frameworks
- Experience with high-performance, large-scale ML systems, language modeling
- Familiarity with ML Accelerators, Kubernetes, large-scale data processing
- Strong problem-solving skills and results-oriented mindset
- Excellent communication skills and collaborative work ability
- Significant software engineering experience
- Ability to balance research goals with practical engineering constraints
- Willingness to take on tasks outside job description
- Enjoyment of pair programming and collaborative work
- Eagerness to learn about machine learning research
- Enthusiasm for working in a cohesive team on large-scale AI research
- Ambitious goals for AI safety and long-term progress
- At least a Bachelor's degree in a related field or equivalent experience
Tasks
- Conduct research on model architecture, algorithms, data processing, and optimizers
- Implement solutions for model architecture, algorithms, data processing, and optimizers
- Lead small research projects independently
- Collaborate with team members on larger initiatives
- Design, run, and analyze scientific experiments
- Optimize and scale training infrastructure for efficiency and reliability
- Develop and improve dev tooling for team productivity
- Contribute to the entire stack, from optimization to model design
- Optimize throughput of novel attention mechanisms
- Propose Transformer variants
- Experimentally compare Transformer variant performance
- Prepare large-scale datasets for model consumption
- Scale distributed training jobs to thousands of accelerators
- Design fault tolerance strategies for training infrastructure
- Create interactive visualizations of model internals
Work Experience
- approx. 1 - 4 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- deep learning frameworks
- ML systems
- language modeling
- ML Accelerators
- Kubernetes
Benefits
Competitive Pay
- Competitive compensation
Social Impact
- Optional equity donation matching
More Vacation Days
- Generous vacation
Generous Parental Leave
- Generous parental leave
Flexible Working
- Flexible working hours
Modern Office
- Collaborative office space
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- AnthropicFull-timeWith HomeofficeExperiencedZürich
- Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Full-timeWith HomeofficeSeniorZürich - Sony AI
Research Intern for Deep Generative Modeling(m/w/x)
Full-timeInternshipWith HomeofficeSchlieren - Snyk Switzerland AG
Senior Incubation Engineer(m/w/x)
Full-timeWith HomeofficeSeniorZürich
Research Engineer / Research Scientist, Pre-training(m/w/x)
Developing multimodal architectures and data processing optimizers for large-scale model training. Expertise in Python, deep learning, and ML accelerators required. Optional equity donation matching and generous parental leave.
Requirements
- Degree (BA required, MS or PhD preferred) in Computer Science, Machine Learning, or related field
- Strong software engineering skills; proven track record building complex systems
- Expertise in Python and deep learning frameworks
- Experience with high-performance, large-scale ML systems, language modeling
- Familiarity with ML Accelerators, Kubernetes, large-scale data processing
- Strong problem-solving skills and results-oriented mindset
- Excellent communication skills and collaborative work ability
- Significant software engineering experience
- Ability to balance research goals with practical engineering constraints
- Willingness to take on tasks outside job description
- Enjoyment of pair programming and collaborative work
- Eagerness to learn about machine learning research
- Enthusiasm for working in a cohesive team on large-scale AI research
- Ambitious goals for AI safety and long-term progress
- At least a Bachelor's degree in a related field or equivalent experience
Tasks
- Conduct research on model architecture, algorithms, data processing, and optimizers
- Implement solutions for model architecture, algorithms, data processing, and optimizers
- Lead small research projects independently
- Collaborate with team members on larger initiatives
- Design, run, and analyze scientific experiments
- Optimize and scale training infrastructure for efficiency and reliability
- Develop and improve dev tooling for team productivity
- Contribute to the entire stack, from optimization to model design
- Optimize throughput of novel attention mechanisms
- Propose Transformer variants
- Experimentally compare Transformer variant performance
- Prepare large-scale datasets for model consumption
- Scale distributed training jobs to thousands of accelerators
- Design fault tolerance strategies for training infrastructure
- Create interactive visualizations of model internals
Work Experience
- approx. 1 - 4 years
Education
- Bachelor's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- deep learning frameworks
- ML systems
- language modeling
- ML Accelerators
- Kubernetes
Benefits
Competitive Pay
- Competitive compensation
Social Impact
- Optional equity donation matching
More Vacation Days
- Generous vacation
Generous Parental Leave
- Generous parental leave
Flexible Working
- Flexible working hours
Modern Office
- Collaborative office space
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
Anthropic
Industry
Science
Description
The company aims to create reliable, interpretable, and steerable AI systems for safe and beneficial use.
Not a perfect match?
- Anthropic
Research Engineer, Production Model Post Training(m/w/x)
Full-timeWith HomeofficeExperiencedZürich - Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Full-timeWith HomeofficeSeniorZürich - Sony AI
Research Intern for Deep Generative Modeling(m/w/x)
Full-timeInternshipWith HomeofficeSchlieren - Snyk Switzerland AG
Senior Incubation Engineer(m/w/x)
Full-timeWith HomeofficeSeniorZürich