Hybrid Werkstudent – AI Research, Data Evaluation

Posted last week

Apply now

About the role

  • As a Research Student at LILT, you'll evaluate AI models for multilingual tasks and work with leading global labs. Opportunity for publishing and innovative research in AI.

Responsibilities

  • Evaluate & Benchmark: Run rigorous evaluations on frontier LLMs and autonomous agents across diverse tasks.
  • Data Engineering: Create or modify benchmark data to test the reasoning and linguistic limits of modern AI.
  • Experimental Research: Design and run experiments to identify "model-breaking points" and interpret the resulting data.

Requirements

  • Currently enrolled at TU Berlin majoring in Computer Science (Bachelor/Master) or a related field
  • Solid understanding of LLMs, natural language processing, or machine learning
  • Highly proficient in Python, Bash, and git
  • Appetite to quickly understand and incorporate new methodologies and models in a rapidly changing research landscape
  • Strong drive to ship customer projects, sometimes on tight deadlines, to high quality
  • Proficient in English
  • Preferred: Proficient in one or more non-English languages

Benefits

  • Work directly with models and teams from frontier labs like Google or Anthropic
  • Opportunity to publish papers in top-tier AI/ML conferences
  • Contribute to industry-standard open-source benchmarks
  • Competitive salary
  • Hybrid environment with an on-site research team

Job title

Werkstudent – AI Research, Data Evaluation

Job type

Experience level

Entry level

Salary

Not specified

Degree requirement

Bachelor's Degree

Tech skills

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job