Your personal AI career agent
Working Student – GenAI / LLM Evaluation – Agentic AI / NLP(m/w/x)
Evaluating agentic AI systems and LLM-based NLP features. Python programming skills and basic ML/NLP concepts required. Dataset curation for benchmarking and regression testing.
Requirements
- Bachelor's or Master's studies in Computer Science, AI/ML, Data Science, Computational Linguistics, or related field
- Hands-on programming skills in Python
- Solid understanding of basic ML/NLP concepts
- Interest in GenAI / LLMs
- Interest in agentic systems
- Interest in evaluation of non-deterministic AI behavior
- Experience with data handling
- Experience with dataset creation (labeling, preprocessing, quality checks)
- Familiarity with software testing concepts (e.g., unit/e2e testing, CI)
- Good written English communication skills
- Good spoken English communication skills
Tasks
- Support evaluation of agentic AI systems
- Evaluate LLM-based NLP features
- Conduct qualitative and quantitative analysis
- Create and curate datasets for benchmarking
- Maintain datasets for regression testing
- Ensure scenario coverage in datasets
- Extend internal evaluation frameworks
- Improve metrics and dashboards
- Automate test runs
- Contribute to end-to-end testing
- Test GenAI features in in-car experiences
- Integrate and validate workflows
- Document evaluation findings
- Track model and system changes
- Communicate results to the team
- Collaborate with engineers and researchers
- Translate insights into actionable improvements
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- GenAI
- LLMs
- AI/ML
- Data Science
- Computational Linguistics
- NLP
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- Fraunhofer-GesellschaftPart-timeWorking StudentOn-siteKarlsruhe
- Exxeta
Werkstudent UX(m/w/x)
Part-timeWorking StudentOn-siteKarlsruhe, Mannheim, Stuttgart - L-Bank
Werkstudent Robotic Process Automation(m/w/x)
Part-timeWorking StudentOn-siteKarlsruhefrom 17.33 / hour - XITASO GmbH
Masterand - Semantic 4D Occupancy Forecasting(m/w/x)
Full-time/Part-timeInternshipOn-siteKarlsruhe, Augsburg, Berlin, Erlangen, Ingolstadt, Krumbach (Schwaben), Leipzig - FZI Forschungszentrum Informatik
Studentische Hilfskraft, Praktikant:in oder Student:in im Rahmen einer Abschlussarbeit für die Entwicklung einer Virtual und Augmented Reality Umgebung für autonome Fahrzeuge(m/w/x)
Part-timeInternshipOn-siteKarlsruhe
Working Student – GenAI / LLM Evaluation – Agentic AI / NLP(m/w/x)
Evaluating agentic AI systems and LLM-based NLP features. Python programming skills and basic ML/NLP concepts required. Dataset curation for benchmarking and regression testing.
Requirements
- Bachelor's or Master's studies in Computer Science, AI/ML, Data Science, Computational Linguistics, or related field
- Hands-on programming skills in Python
- Solid understanding of basic ML/NLP concepts
- Interest in GenAI / LLMs
- Interest in agentic systems
- Interest in evaluation of non-deterministic AI behavior
- Experience with data handling
- Experience with dataset creation (labeling, preprocessing, quality checks)
- Familiarity with software testing concepts (e.g., unit/e2e testing, CI)
- Good written English communication skills
- Good spoken English communication skills
Tasks
- Support evaluation of agentic AI systems
- Evaluate LLM-based NLP features
- Conduct qualitative and quantitative analysis
- Create and curate datasets for benchmarking
- Maintain datasets for regression testing
- Ensure scenario coverage in datasets
- Extend internal evaluation frameworks
- Improve metrics and dashboards
- Automate test runs
- Contribute to end-to-end testing
- Test GenAI features in in-car experiences
- Integrate and validate workflows
- Document evaluation findings
- Track model and system changes
- Communicate results to the team
- Collaborate with engineers and researchers
- Translate insights into actionable improvements
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- GenAI
- LLMs
- AI/ML
- Data Science
- Computational Linguistics
- NLP
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
Cinemo
Industry
IT
Description
The company builds and maintains shared internal developer platforms and self-service tooling to enable engineering teams to move quickly.
Not a perfect match?
- Fraunhofer-Gesellschaft
Werkstudent im Bereich emulator-basiertes Fuzzing(m/w/x)
Part-timeWorking StudentOn-siteKarlsruhe - Exxeta
Werkstudent UX(m/w/x)
Part-timeWorking StudentOn-siteKarlsruhe, Mannheim, Stuttgart - L-Bank
Werkstudent Robotic Process Automation(m/w/x)
Part-timeWorking StudentOn-siteKarlsruhefrom 17.33 / hour - XITASO GmbH
Masterand - Semantic 4D Occupancy Forecasting(m/w/x)
Full-time/Part-timeInternshipOn-siteKarlsruhe, Augsburg, Berlin, Erlangen, Ingolstadt, Krumbach (Schwaben), Leipzig - FZI Forschungszentrum Informatik
Studentische Hilfskraft, Praktikant:in oder Student:in im Rahmen einer Abschlussarbeit für die Entwicklung einer Virtual und Augmented Reality Umgebung für autonome Fahrzeuge(m/w/x)
Part-timeInternshipOn-siteKarlsruhe