The AI Job Search Engine
Research Engineer / Research Scientist, Pre-training(m/w/x)
Description
You will advance the next generation of multimodal LLMs by blending cutting-edge research with engineering, from scaling massive training jobs to designing innovative model architectures.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Degree in Computer Science, Machine Learning, or related field
- •Strong software engineering skills
- •Expertise in Python and deep learning frameworks
- •Experience with high-performance, large-scale ML systems
- •Familiarity with ML Accelerators, Kubernetes, and data processing
- •Strong problem-solving skills and results-oriented mindset
- •Excellent communication and collaboration skills
- •Significant software engineering experience
- •Ability to balance research and engineering constraints
- •Willingness to support team with diverse tasks
- •Enjoyment of pair programming and collaborative work
- •Eagerness to learn machine learning research
- •Enthusiasm for large-scale AI research projects
- •Ambitious goals for AI safety and progress
- •Bachelor's degree in related field or equivalent experience
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Conduct research on model architectures and algorithms
- •Implement solutions for data processing and optimizers
- •Lead independent research projects and team initiatives
- •Design and analyze scientific experiments for LLMs
- •Optimize and scale training infrastructure for reliability
- •Develop dev tooling to enhance team productivity
- •Contribute to low-level optimizations and model design
- •Optimize throughput for novel attention mechanisms
- •Propose and experimentally compare Transformer variants
- •Prepare large-scale datasets for model consumption
- •Scale distributed training to thousands of accelerators
- •Design fault tolerance strategies for training systems
- •Create interactive visualizations of model internals
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Flexible working hours
Social Impact
- •Optional equity donation matching
More Vacation Days
- •Generous vacation
Generous Parental Leave
- •Generous parental leave
Modern Office
- •Collaborative office space
- AnthropicFull-timeWith HomeofficeExperiencedZürich
- Anthropic
Research Engineer, Production Model Post Training(m/w/x)
Full-timeWith HomeofficeExperiencedZürich - Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Full-timeWith HomeofficeSeniorZürich - Lakera
Senior AI Engineer(m/w/x)
Full-timeRemoteSeniorZürich
Research Engineer / Research Scientist, Pre-training(m/w/x)
The AI Job Search Engine
Description
You will advance the next generation of multimodal LLMs by blending cutting-edge research with engineering, from scaling massive training jobs to designing innovative model architectures.
Let AI find the perfect jobs for you!
Upload your CV and Nejo AI will find matching job offers for you.
Requirements
- •Degree in Computer Science, Machine Learning, or related field
- •Strong software engineering skills
- •Expertise in Python and deep learning frameworks
- •Experience with high-performance, large-scale ML systems
- •Familiarity with ML Accelerators, Kubernetes, and data processing
- •Strong problem-solving skills and results-oriented mindset
- •Excellent communication and collaboration skills
- •Significant software engineering experience
- •Ability to balance research and engineering constraints
- •Willingness to support team with diverse tasks
- •Enjoyment of pair programming and collaborative work
- •Eagerness to learn machine learning research
- •Enthusiasm for large-scale AI research projects
- •Ambitious goals for AI safety and progress
- •Bachelor's degree in related field or equivalent experience
Education
Work Experience
approx. 4 - 6 years
Tasks
- •Conduct research on model architectures and algorithms
- •Implement solutions for data processing and optimizers
- •Lead independent research projects and team initiatives
- •Design and analyze scientific experiments for LLMs
- •Optimize and scale training infrastructure for reliability
- •Develop dev tooling to enhance team productivity
- •Contribute to low-level optimizations and model design
- •Optimize throughput for novel attention mechanisms
- •Propose and experimentally compare Transformer variants
- •Prepare large-scale datasets for model consumption
- •Scale distributed training to thousands of accelerators
- •Design fault tolerance strategies for training systems
- •Create interactive visualizations of model internals
Tools & Technologies
Languages
English – Business Fluent
Benefits
Flexible Working
- •Flexible working hours
Social Impact
- •Optional equity donation matching
More Vacation Days
- •Generous vacation
Generous Parental Leave
- •Generous parental leave
Modern Office
- •Collaborative office space
About the Company
Anthropic
Industry
Research
Description
The company aims to create reliable, interpretable, and steerable AI systems for safe and beneficial use.
- Anthropic
Research Engineer / Research Scientist, Pretraining(m/w/x)
Full-timeWith HomeofficeExperiencedZürich - Anthropic
Research Engineer, Production Model Post Training(m/w/x)
Full-timeWith HomeofficeExperiencedZürich - Mistral
AI Scientist(m/w/x)
Full-timeWith HomeofficeNot specifiedZürich - ANYbotics
Senior AI Research Engineer, Visual Perception(m/w/x)
Full-timeWith HomeofficeSeniorZürich - Lakera
Senior AI Engineer(m/w/x)
Full-timeRemoteSeniorZürich