Working Student – GenAI / LLM Evaluation – Agentic AI / NLP (f/m/d)
Job details
Company
Cinemo GmbH
Location
Karlsruhe, Germany
Employment type
Company
Cinemo GmbH
Location
Karlsruhe, Germany
Employment type
Cinemo GmbHKarlsruhe, Germany
Cinemo GmbHKarlsruhe, Germany
Working Student
Seniority
Intern
Primary category
Software Development
Secondary category
NLP
Posted date
6 Feb 2026
Valid through
7 Apr 2026
Support evaluation of agentic AI systems and LLM-based NLP features, including qualitative and quantitative analysis.
Create, curate, and maintain datasets for benchmarking, regression testing, and scenario coverage.
Extend and improve internal evaluation frameworks (metrics, dashboards, automated test runs).
Contribute to end-to-end testing of GenAI features within the in-car experience, including integration and validation workflows.
Document findings, track model/system changes, and communicate results clearly to the team.
Collaborate with engineers and researchers to translate evaluation insights into actionable improvements.
Ongoing Bachelor’s or Master’s studies in Computer Science, AI/ML, Data Science, Computational Linguistics, or a related field.
Hands-on programming skills in Python and a solid understanding of basic ML/NLP concepts.
Interest in GenAI / LLMs, agentic systems, and evaluation of non-deterministic AI behavior.
Experience with data handling and dataset creation (labeling, preprocessing, quality checks).
Familiarity with software testing concepts (e.g., unit/e2e testing, CI) is a plus.
Good written and spoken English communication skills.
The successful candidate will be based in Karlsruhe, Germany.
Cinemo GmbHKarlsruhe, Germany
Cinemo GmbHHybrid, Germany
ValeoBremen-Hemelingen; Wunstorf, Germany
N8nRemote, Germany