The AI Job Search Engine
Data Scientist, Evals(m/w/x)
Architecting automated evaluation pipelines for search technologies, including VLM-based visual rendering and public benchmark adaptation. 4+ years data science/ML experience with AWS/Databricks proficiency required. Focus on cutting-edge AI/ML in advanced search technologies.
Requirements
- PhD, MS in technical field, or equivalent experience
- 4+ years data science or machine learning experience
- Strong proficiency in Python and SQL
- Experience with AWS and Databricks cloud stacks
- Comfort with agentic coding and AI-assisted development
- 1+ years experience with LLMs at scale
- Experience with customer-facing web products at scale
- Strong research background in real-world ML
- Experience defining evaluation metrics and datasets
Tasks
- Build specialized evaluations to improve answer quality
- Architect and maintain automated evaluation pipelines
- Design evaluation sets for tool calls and web search retrieval
- Develop VLM-based solutions for visual rendering evaluation
- Review and adapt public benchmarks for product performance
- Collaborate with technical leadership to shape product changes
- Measure and improve overall answer quality metrics
Work Experience
- 4 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- SQL
- AWS
- Databricks
- LLMs
- LLM-as-a-judge
Not a perfect match?
- PerplexityFull-timeOn-siteExperiencedBerlin
- SumUp
Data Scientist(m/w/x)
Full-timeOn-siteExperiencedBerlin - Perplexity
Software Engineer - Data Flywheel(m/w/x)
Full-timeOn-siteExperiencedBerlin - Prior Labs
Data Scientist(m/w/x)
Full-timeOn-siteExperiencedFreiburg im Breisgau, Berlin - Cresta
Senior Machine Learning Engineer(m/w/x)
Full-timeOn-siteSeniorBerlin
Data Scientist, Evals(m/w/x)
Architecting automated evaluation pipelines for search technologies, including VLM-based visual rendering and public benchmark adaptation. 4+ years data science/ML experience with AWS/Databricks proficiency required. Focus on cutting-edge AI/ML in advanced search technologies.
Requirements
- PhD, MS in technical field, or equivalent experience
- 4+ years data science or machine learning experience
- Strong proficiency in Python and SQL
- Experience with AWS and Databricks cloud stacks
- Comfort with agentic coding and AI-assisted development
- 1+ years experience with LLMs at scale
- Experience with customer-facing web products at scale
- Strong research background in real-world ML
- Experience defining evaluation metrics and datasets
Tasks
- Build specialized evaluations to improve answer quality
- Architect and maintain automated evaluation pipelines
- Design evaluation sets for tool calls and web search retrieval
- Develop VLM-based solutions for visual rendering evaluation
- Review and adapt public benchmarks for product performance
- Collaborate with technical leadership to shape product changes
- Measure and improve overall answer quality metrics
Work Experience
- 4 years
Education
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Python
- SQL
- AWS
- Databricks
- LLMs
- LLM-as-a-judge
About the Company
Perplexity
Industry
IT
Description
The company focuses on building advanced search technologies, emphasizing retrieval and ranking.
Not a perfect match?
- Perplexity
Data Scientist/Engineer – Online Metrics(m/w/x)
Full-timeOn-siteExperiencedBerlin - SumUp
Data Scientist(m/w/x)
Full-timeOn-siteExperiencedBerlin - Perplexity
Software Engineer - Data Flywheel(m/w/x)
Full-timeOn-siteExperiencedBerlin - Prior Labs
Data Scientist(m/w/x)
Full-timeOn-siteExperiencedFreiburg im Breisgau, Berlin - Cresta
Senior Machine Learning Engineer(m/w/x)
Full-timeOn-siteSeniorBerlin