Skip to content
@METR

METR

Model Evaluation and Threat Research

Model Evaluation and Threat Research (METR)

METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.

We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.

Read more about our work here.

Our Software

Popular repositories Loading

  1. task-standard task-standard Public

    METR Task Standard

    TypeScript 129 31

  2. vivaria vivaria Public

    Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

    TypeScript 71 20

  3. public-tasks public-tasks Public

    TeX 69 3

  4. ai-rd-tasks ai-rd-tasks Public

    Python 38 4

  5. task-template task-template Public template

    TypeScript 9 5

  6. autonomy-evals-guide autonomy-evals-guide Public

    SCSS 3 4

Repositories

Showing 10 of 16 repositories
  • vivaria Public

    Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

    METR/vivaria’s past year of commit activity
    TypeScript 71 MIT 20 229 (3 issues need help) 20 Updated Dec 15, 2024
  • public-tasks Public
    METR/public-tasks’s past year of commit activity
    TeX 69 3 0 1 Updated Dec 14, 2024
  • ai-rd-tasks Public
    METR/ai-rd-tasks’s past year of commit activity
    Python 38 4 1 1 Updated Dec 13, 2024
  • METR/task-protected-scoring’s past year of commit activity
    Python 0 1 3 2 Updated Dec 13, 2024
  • task-assets Public
    METR/task-assets’s past year of commit activity
    Python 0 0 1 1 Updated Dec 11, 2024
  • viv-task-dev Public
    METR/viv-task-dev’s past year of commit activity
    Shell 0 1 6 0 Updated Dec 10, 2024
  • .github Public
    METR/.github’s past year of commit activity
    0 0 0 0 Updated Nov 24, 2024
  • nanoGPT Public Forked from karpathy/nanoGPT

    The simplest, fastest repository for training/finetuning medium-sized GPTs.

    METR/nanoGPT’s past year of commit activity
    Python 0 MIT 6,143 0 0 Updated Nov 22, 2024
  • METR/autonomy-evals-guide’s past year of commit activity
    SCSS 3 MIT 4 0 1 Updated Nov 21, 2024
  • METR/task-legacy-verifier’s past year of commit activity
    Python 0 0 0 0 Updated Nov 8, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…