Researcher at nonprofit METR focusing on understanding AI capabilities and risks. Engaging in various projects related to AI assessment amid a collaborative research culture.
Responsibilities
We're seeking a researcher to help us better understand AI capabilities.
Previous work in this vein includes agent time horizons, a commonly-used metric for measuring AI progress, and RCTs on open-source developer productivity.
Lead a project investigating transcripts as a source of evidence about agent capabilities.
Improve METR's time-horizon metric to make it more externally valid, more interpretable, and more predictive on threat-model relevant capabilities.
Design and build experiments testing agent capabilities in the wild.
Lead large-scale human-subjects experiments measuring the impacts of AI agents on economically-valuable R&D.
Requirements
You can write code. At the very least, you should be able to quickly write a write a data analysis script in Python to answer an important question. Bonus points if you can write a clean PR too.
You're excited to get your hands dirty. METR researchers often interact with LLMs in a wide variety of scenarios, read lots of agent transcripts, and closely review human outputs (e.g. video recordings of developers in our productivity RCT).
You are undaunted by open-ended mandates. You can take a confusing or ill-posed question and produce insightful and helpful frameworks/proposals/results.
You should be able to read, understand, and critique a research proposal. You're able to understand how particular projects fit into METR's overall mission.
You're a good written communicator. Bonus points if you can write a great paper.
Research Scientist analyzing and interpreting healthcare data to support specialty value - based programs with Humana. Collaborating with clinical teams to derive actionable insights from complex datasets.
Principal Scientist leading biopharmaceutical downstream processing in a clinical and commercial environment. Managing a team and collaborating across divisions to support licenses and commercialization.
Associate Principal Scientist in Supply Analytical Sciences responsible for analytical method development and ensuring pharmaceutical product supply. Supporting commercial products in a fast - paced, multidisciplinary team environment.
Postdoctoral Fellow at the Usher Institute focusing on socio - legal aspects of AI. Engaging collaboratively in a multidisciplinary project exploring health and technology in society.
Laboratory Research Assistant conducting data collection for clinical device trials at UHS locations in New York. Responsible for maintaining data integrity and compliance with research protocols.
Senior Applied Scientist leading research and innovations in AI to make legal knowledge accessible at Robin. Bridging gaps between advanced search technologies and legal information needs in a hybrid work environment.
Research Assistant conducting research in pediatrics at Boston Medical Center. Involves patient recruitment, data management, and assisting investigators in preparations.
Research Scientist focusing on developing algorithms and productionizing World Models at Waabi. Join a startup pioneering autonomous transportation technology with a world - class team.
Research Scientist conducting applied research on healthcare data at Phare using AI technologies. Designing experiments and collaborating with MLOps to create measurable impact in the healthcare industry.
Postdoctoral Research Assistant leading analysis of unique datasets on homelessness and health. Join multidisciplinary team at Melbourne Children’s Campus addressing adolescent health challenges.