Skip to content
Change the repository type filter

All

    Repositories list

    • vodle

      Public
      We develop an interactive, consensus-oriented group decision app
      JavaScript
      GNU Affero General Public License v3.0
      1726516Updated Oct 14, 2024Oct 14, 2024
    • satisfia

      Public
      Satisficing-based Intelligent Agents
      Python
      GNU Affero General Public License v3.0
      2475Updated Oct 3, 2024Oct 3, 2024
    • A repo to explore multi-agent reinforcement learning in the context of aspiration based, non-maximising agents. This project is part of the Supervised Program for Alignment Research.
      Jupyter Notebook
      0000Updated Aug 25, 2024Aug 25, 2024
    • Jupyter Notebook
      MIT License
      4000Updated Aug 8, 2024Aug 8, 2024
    • Webppl library for generating Gridworld MDPs. JS library for displaying Gridworld. Additional agents that satisfice.
      JavaScript
      4202Updated Mar 7, 2024Mar 7, 2024
    • High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
      Python
      Other
      673000Updated Dec 11, 2023Dec 11, 2023
    • Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
      Python
      MIT License
      177100Updated Oct 15, 2023Oct 15, 2023
    • Jupyter Notebook
      1000Updated Oct 4, 2023Oct 4, 2023
    • A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
      Python
      Apache License 2.0
      59000Updated Aug 25, 2023Aug 25, 2023
    • This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.
      Python
      Apache License 2.0
      121000Updated Jul 17, 2023Jul 17, 2023
    • Simple and easily configurable grid world environments for reinforcement learning
      Python
      Other
      612000Updated Jul 13, 2023Jul 13, 2023
    • Markov Decision Process (MDP) Toolbox for Python
      Jupyter Notebook
      BSD 3-Clause "New" or "Revised" License
      252000Updated Jun 30, 2023Jun 30, 2023
    • A validated automatic evaluator for instruction-following language models. High-quality, cheap, and fast.
      Jupyter Notebook
      Apache License 2.0
      246000Updated Jun 30, 2023Jun 30, 2023
    • motabarnn

      Public
      python package for torch-based neural network version of MoTaBaR
      Python
      Apache License 2.0
      0006Updated Jun 21, 2023Jun 21, 2023
    • PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
      Python
      MIT License
      1.7k000Updated Jun 13, 2023Jun 13, 2023
    • A modular RL library to fine-tune language models to human preferences
      Python
      Apache License 2.0
      191000Updated May 31, 2023May 31, 2023
    • pyoptes

      Public
      Python framework for optimization of epidemic testing strategies
      Python
      BSD 2-Clause "Simplified" License
      0100Updated Apr 12, 2023Apr 12, 2023
    • Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
      Python
      24000Updated Mar 13, 2023Mar 13, 2023
    • Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
      Python
      MIT License
      454000Updated Feb 3, 2023Feb 3, 2023
    • C++
      MIT License
      16000Updated Jan 21, 2023Jan 21, 2023
    • tricl

      Public
      TriCl model in C++
      C++
      GNU General Public License v3.0
      0020Updated Sep 23, 2022Sep 23, 2022
    • Python
      Apache License 2.0
      121000Updated Jan 11, 2022Jan 11, 2022
    • quantify agents' degrees of moral responsibility in complex multi-agent decision situations
      Python
      BSD 2-Clause "Simplified" License
      0000Updated Dec 15, 2021Dec 15, 2021
    • Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
      Python
      Apache License 2.0
      2000Updated Jun 3, 2021Jun 3, 2021
    • Reinforcement Learning through Active Inference with additional safety measures
      Python
      22000Updated Apr 27, 2020Apr 27, 2020
    • Modeling agents with probabilistic programs
      TeX
      17000Updated Sep 4, 2019Sep 4, 2019