Researcher at nonprofit METR focusing on understanding AI capabilities and risks. Engaging in various projects related to AI assessment amid a collaborative research culture.
Responsibilities
We're seeking a researcher to help us better understand AI capabilities.
Previous work in this vein includes agent time horizons, a commonly-used metric for measuring AI progress, and RCTs on open-source developer productivity.
Lead a project investigating transcripts as a source of evidence about agent capabilities.
Improve METR's time-horizon metric to make it more externally valid, more interpretable, and more predictive on threat-model relevant capabilities.
Design and build experiments testing agent capabilities in the wild.
Lead large-scale human-subjects experiments measuring the impacts of AI agents on economically-valuable R&D.
Requirements
You can write code. At the very least, you should be able to quickly write a write a data analysis script in Python to answer an important question. Bonus points if you can write a clean PR too.
You're excited to get your hands dirty. METR researchers often interact with LLMs in a wide variety of scenarios, read lots of agent transcripts, and closely review human outputs (e.g. video recordings of developers in our productivity RCT).
You are undaunted by open-ended mandates. You can take a confusing or ill-posed question and produce insightful and helpful frameworks/proposals/results.
You should be able to read, understand, and critique a research proposal. You're able to understand how particular projects fit into METR's overall mission.
You're a good written communicator. Bonus points if you can write a great paper.
Research Scientist at Valence Labs developing ML models for predicting cellular responses in drug discovery. Building generative models based on massive multiomics datasets with collaborative research.
Research Assistant responsible for statistical data analysis under a Principal Investigator for health science projects. Involves data management, statistical analyses, and documentation of workflows.
Temporary Research Assistant supporting data collection for Medical Ethics at University of Pennsylvania. Engaging in programming activities and organizing research - related documentation.
Machine Learning Research Scientist conducting applied AI/ML research at SEI. Developing prototype capabilities for government workflows with a focus on mission context.
Summer Research Assistant helping with research and field work in agroforestry systems. Collecting data and assisting in various tasks based in Wisconsin.
R&D Scientist at IFF developing innovative flavor and fragrance delivery technologies. Conducting research and experiments to enhance consumer experiences in everyday products.
Postdoctoral Research Fellow involved in legal research for sustainable textile value chains. Collaborating with researchers on a funded project for law and circular economy.
Research Assistant at HHL Leipzig focusing on psychological and leadership studies. Supporting research projects and students while pursuing a doctorate in a dynamic academic environment.
Research Scientist RN at Providence assuming responsibilities in leading research studies and innovative educational programs. Focusing on quality improvement and dissemination of study results in healthcare.