Senior Research Scientist, Reward Models at Anthropic | Hybrid Hired

About the role

Senior Research Scientist leading research efforts on reward models for AI. Shaping how models understand and optimize for human preferences with a focus on AI safety and capability.

Responsibilities

Lead research on novel reward model architectures and training approaches for RLHF
Develop and evaluate LLM-based grading and evaluation methods, including rubric-driven approaches that improve consistency and interpretability
Research techniques to detect, characterize, and mitigate reward hacking and specification gaming
Design experiments to understand reward model generalization, robustness, and failure modes
Collaborate with the Finetuning team to translate research insights into improvements for production training pipelines
Contribute to research publications, blog posts, and internal documentation
Mentor other researchers and help build institutional knowledge around reward modeling

Requirements

A track record of research contributions in reward modeling, RLHF, or closely related areas of machine learning
Experience training and evaluating reward models for large language models
Comfortable designing and running large-scale experiments with significant computational resources
Work effectively across research and engineering, iterating quickly while maintaining scientific rigor
Enjoy collaborative research and can communicate complex ideas clearly to diverse audiences
Care deeply about building AI systems that are both highly capable and safe.
Strong candidates may also have published research on reward modeling, preference learning, or RLHF
Experience with LLM-as-judge approaches including calibration and reliability challenges
Worked on reward hacking, specification gaming, or related robustness problems
Experience with constitutional AI, debate, or other scalable oversight approaches
Contributed to production ML systems at scale
Familiarity with interpretability techniques as applied to understanding reward model behavior.

Benefits

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours

Similar roles

Browse all Research Scientist jobs

16 hours ago

ME

Research Assistant

Meissner

Research Assistant conducting experiments and supporting innovation at Meissner in scientific research. Collaborating with R&D teams and maintaining meticulous records of experimental data.

Onsite Role

Camarillo United States Research Scientist

$24 - $31 per hour

yesterday

BS

Senior Principal Scientist, Cardiovascular, Translational Development

Bristol Myers Squibb

Senior Principal Scientist focusing on cardiovascular translational development and late - stage clinical research at Bristol Myers Squibb. Integrates laboratory science and project management to maximize drug development potential.

Hybrid Role

Princeton United States Research Scientist

$184,060 - $223,036 per year

2 days ago

GE

Applied Research Scientist, II

GEICO

Senior Research Scientist leading AI initiatives at GEICO, focusing on complex ML and GenAI solutions while mentoring teams and delivering business impact.

Hybrid Role

New York City United States Research Scientist

$105,000 - $215,000 per year

3 days ago

BB

Referendar, Wissenschaftlicher Mitarbeiter – verschiedene Bereiche

Bird & Bird

Referendar und wissenschaftlicher Mitarbeiter at law firm Bird & Bird in Frankfurt, focusing on high - tech legal sectors. Engaging in direct case work and providing exam preparation support within a collegial environment.

Hybrid Role

Frankfurt am Main Germany Research Scientist

3 days ago

BB

Referendar, Wissenschaftlicher Mitarbeiter – Verschiedene Bereiche

Bird & Bird

Referendar and wissenschaftlicher Mitarbeiter role at Bird & Bird in Düsseldorf. Focus on corporate law and technology sectors with flexible working arrangements.

Hybrid Role

Düsseldorf Germany Research Scientist

4 days ago

MC

Research Assistant – Housing

Metropolitan Planning Council

Research Assistant supporting a statewide housing research initiative focused on barriers to production. Engaging in qualitative research, analysis, and stakeholder coordination from Chicago or remote in Illinois.

Hybrid Role

Chicago United States Research Scientist

$18 per hour

4 days ago

MC

Research Assistant

Metropolitan Planning Council

Research Assistant supporting equity - focused urban planning and public policy initiatives at MPC. Conducting research, analysis, and project support for policy areas including housing and transportation.

Hybrid Role

Chicago United States Research Scientist

$18 per hour

4 days ago

MC

Research Assistant, Water

Metropolitan Planning Council

Research Assistant contributing to Metropolitan Planning Council’s water policy initiatives through research and analytical support. Involves engagement in equity - focused urban planning and public policy.

Hybrid Role

Chicago United States Research Scientist

$18 per hour

4 days ago

MC

Research Assistant – Transportation

Metropolitan Planning Council

Research Assistant contributing to transportation policy initiatives focused on sustainable development at Metropolitan Planning Council. Engaging in research, analysis, and stakeholder support within a collaborative team.

Hybrid Role

Chicago United States Research Scientist

$18 per hour

4 days ago

NN

Principal Scientist, DMPK Project Representative

Novo Nordisk

Principal Scientist leading DMPK strategy for drug discovery programs at Novo Nordisk. Engaging in cross - functional collaboration and managing research initiatives for therapeutic areas.

Onsite Role

Lexington United States Research Scientist

$148,290 - $259,510 per year