Senior Research Scientist creating evaluation methods and benchmarks for LLMs at Cohere. Working with cross-functional teams to advance AI capabilities and model performance evaluation.
Responsibilities
Create ambitious new evaluation benchmarks that push the limits of what our models can accomplish.
Work on highly cross-functional teams to translate model feedback into trustworthy, repeatable evaluations.
Conduct research to advance the state-of-the-art in LLM evaluation methods, including training LLM judges; refining LLM-based data synthesis pipelines; and improving evaluation efficiency.
Build scalable and reusable tools for digging into model performance.
Requirements
You enjoy rapidly building prototypes that demonstrate the boundaries of what LLMs are capable of, and you have developed resources to measure those capabilities.
You have spent dozens of hours reviewing complex data and LLM outputs to ensure high data quality.
You are obsessive about rigorously measuring AI capabilities, and also about making sure your measurements actually align with the capabilities you care about.
You have strong software engineering skills.
Benefits
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
Postdoctoral position working in a collaborative project to develop a new treatment for colorectal cancer using organoid models. Aimed at bridging academic and industry - related research in health.
Principal Scientist leading outcomes research in Hematology at a global health care firm. Focus on generating real - world evidence for innovative products and healthcare outcomes.
Postdoctoral Research Fellow studying molecular/epigenetic epidemiology at the University of Edinburgh. Engaging in research on biomarkers linked to organ ageing and disease outcomes, contributing to projects and collaborations.
Wissenschaftlicher Mitarbeiter in Mikrobiologie bei MVZ diagnosticum GmbH. Verantwortlich für Projektleitung und fachliche Anleitung im medizinischen Labor.
Associate Director for Data Strategy contributing to real - world evidence with advanced analytics and partnerships. Driving innovative projects in healthcare decision - making and data science in a collaborative environment.
Principal Scientist leading CDx/IVD development projects in Translational Oncology within a leading health care company. Collaborating with teams on diagnostics and biomarker strategies.
Research Assistant in Electrolysis at Fraunhofer Institute specializing in hydrogen technology development. Engaging in cutting - edge research and innovative projects in electrochemical processes.
Principal Scientist responsible for drug discovery activities within Small Molecule Research at Novo Nordisk. Leading assay development, managing CROs, and collaborating across teams globally.
Lead Applied Research Scientist at ILLUIN Technology transforming state - of - the - art AI into functional solutions. Combining academic excellence and startup agility in a hybrid role.
Wissenschaftlicher Mitarbeiter responsible for microbiology projects at MVZ diagnosticum GmbH. Leading teams and organizing workflows within medical laboratory environment in Neukirchen.