Skip to content
@AlignmentResearch

FAR.AI

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

Popular repositories Loading

  1. tuned-lens tuned-lens Public

    Tools for understanding how transformer predictions are built layer-by-layer

    Python 429 47

  2. go_attack go_attack Public

    Python 80 7

  3. vlmrm vlmrm Public

    Python 41 12

  4. gpt-4-novel-apis-attacks gpt-4-novel-apis-attacks Public

    17 1

  5. learned-planner learned-planner Public

    Interpretability tools for recurrent networks that play Sokoban

    Python 7 2

  6. KataGo-custom KataGo-custom Public

    Child repository of https://github.com/HumanCompatibleAI/go_attack.

    C++ 4 1

Repositories

Showing 10 of 29 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…