Senior Research Scientist leading research efforts on reward models for AI. Shaping how models understand and optimize for human preferences with a focus on AI safety and capability.
Responsibilities
Lead research on novel reward model architectures and training approaches for RLHF
Develop and evaluate LLM-based grading and evaluation methods, including rubric-driven approaches that improve consistency and interpretability
Research techniques to detect, characterize, and mitigate reward hacking and specification gaming
Design experiments to understand reward model generalization, robustness, and failure modes
Collaborate with the Finetuning team to translate research insights into improvements for production training pipelines
Contribute to research publications, blog posts, and internal documentation
Mentor other researchers and help build institutional knowledge around reward modeling
Requirements
A track record of research contributions in reward modeling, RLHF, or closely related areas of machine learning
Experience training and evaluating reward models for large language models
Comfortable designing and running large-scale experiments with significant computational resources
Work effectively across research and engineering, iterating quickly while maintaining scientific rigor
Enjoy collaborative research and can communicate complex ideas clearly to diverse audiences
Care deeply about building AI systems that are both highly capable and safe.
Strong candidates may also have published research on reward modeling, preference learning, or RLHF
Experience with LLM-as-judge approaches including calibration and reliability challenges
Worked on reward hacking, specification gaming, or related robustness problems
Experience with constitutional AI, debate, or other scalable oversight approaches
Contributed to production ML systems at scale
Familiarity with interpretability techniques as applied to understanding reward model behavior.
Research Assistant conducting experiments and supporting innovation at Meissner in scientific research. Collaborating with R&D teams and maintaining meticulous records of experimental data.
Senior Principal Scientist focusing on cardiovascular translational development and late - stage clinical research at Bristol Myers Squibb. Integrates laboratory science and project management to maximize drug development potential.
Senior Research Scientist leading AI initiatives at GEICO, focusing on complex ML and GenAI solutions while mentoring teams and delivering business impact.
Referendar und wissenschaftlicher Mitarbeiter at law firm Bird & Bird in Frankfurt, focusing on high - tech legal sectors. Engaging in direct case work and providing exam preparation support within a collegial environment.
Referendar and wissenschaftlicher Mitarbeiter role at Bird & Bird in Düsseldorf. Focus on corporate law and technology sectors with flexible working arrangements.
Research Assistant supporting a statewide housing research initiative focused on barriers to production. Engaging in qualitative research, analysis, and stakeholder coordination from Chicago or remote in Illinois.
Research Assistant supporting equity - focused urban planning and public policy initiatives at MPC. Conducting research, analysis, and project support for policy areas including housing and transportation.
Research Assistant contributing to Metropolitan Planning Council’s water policy initiatives through research and analytical support. Involves engagement in equity - focused urban planning and public policy.
Research Assistant contributing to transportation policy initiatives focused on sustainable development at Metropolitan Planning Council. Engaging in research, analysis, and stakeholder support within a collaborative team.
Principal Scientist leading DMPK strategy for drug discovery programs at Novo Nordisk. Engaging in cross - functional collaboration and managing research initiatives for therapeutic areas.