Researcher at nonprofit METR focusing on understanding AI capabilities and risks. Engaging in various projects related to AI assessment amid a collaborative research culture.
Responsibilities
We're seeking a researcher to help us better understand AI capabilities.
Previous work in this vein includes agent time horizons, a commonly-used metric for measuring AI progress, and RCTs on open-source developer productivity.
Lead a project investigating transcripts as a source of evidence about agent capabilities.
Improve METR's time-horizon metric to make it more externally valid, more interpretable, and more predictive on threat-model relevant capabilities.
Design and build experiments testing agent capabilities in the wild.
Lead large-scale human-subjects experiments measuring the impacts of AI agents on economically-valuable R&D.
Requirements
You can write code. At the very least, you should be able to quickly write a write a data analysis script in Python to answer an important question. Bonus points if you can write a clean PR too.
You're excited to get your hands dirty. METR researchers often interact with LLMs in a wide variety of scenarios, read lots of agent transcripts, and closely review human outputs (e.g. video recordings of developers in our productivity RCT).
You are undaunted by open-ended mandates. You can take a confusing or ill-posed question and produce insightful and helpful frameworks/proposals/results.
You should be able to read, understand, and critique a research proposal. You're able to understand how particular projects fit into METR's overall mission.
You're a good written communicator. Bonus points if you can write a great paper.
Research Assistant providing patient care activities and supporting research projects at Cleveland Clinic. Collaborating with a team to ensure effective delivery of study - related tasks.
Research Scientist responsible for executing biology development programs and product expansion at AgroFresh. Collaboration with sales teams and maintaining communication with global R&D labs.
Research Assistant at Leibniz - Institut für Kristallzüchtung focusing on characterization of ferroelectric materials. Collaborating with international partners on X - ray diffraction and electrical measurements.
Associate Principal Scientist supporting outcomes research in Hematology to enable patient access to innovations. Collaborating with cross - functional teams to generate real - world evidence and improve patient health.
Associate Principal Scientist supporting in vitro pharmacology for early drug discovery in various locations across the U.S. Collaborating with CROs and overseeing scientific workflows in a hybrid work model.
Senior Principal Scientist for Cyber/EW capabilities at RTX, leading innovative cybersecurity and electromagnetic warfare projects in a defense context.
Principal Cyber Research Scientist at RTX conducting research on offensive and defensive cyberspace capabilities. Leading teams to develop solutions for U.S. defense and intelligence community needs.
Associate Research Scientist performing analytical testing and method optimization for biopharma products. Leading project teams and ensuring compliance with quality standards.
Graduate Research Assistant supporting innovative research at Foreign Policy Analytics. Collaborating with stakeholders and conducting analysis across various policy issues and sectors.
Research Assistant in Business Support Services conducting analytical support and research for corporate initiatives. Tracking data accuracy and participating in project management efforts to enhance service delivery.