PIK GaNe

All

26 repositories

vodle
Public
We develop an interactive, consensus-oriented group decision app
svg couchdb app ionic decision-making voting delegation democracy consensus budgeting
JavaScript
•
GNU Affero General Public License v3.0
•17•26•51•6•Updated Oct 14, 2024Oct 14, 2024
satisfia
Public
Satisficing-based Intelligent Agents
Python
•
GNU Affero General Public License v3.0
•2•4•7•5•Updated Oct 3, 2024Oct 3, 2024
satisfia-marl
Public
A repo to explore multi-agent reinforcement learning in the context of aspiration based, non-maximising agents. This project is part of the Supervised Program for Alignment Research.
Jupyter Notebook
•0•0•0•0•Updated Aug 25, 2024Aug 25, 2024
pref_voting
Public
Jupyter Notebook
•
MIT License
•4•0•0•0•Updated Aug 8, 2024Aug 8, 2024
webppl-agents-satisfia
Public
Webppl library for generating Gridworld MDPs. JS library for displaying Gridworld. Additional agents that satisfice.
JavaScript
•4•2•0•2•Updated Mar 7, 2024Mar 7, 2024
cleanrl-satisfia
Public
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python
•
Other
•673•0•0•0•Updated Dec 11, 2023Dec 11, 2023
stable-baselines3-contrib-satisfia
Public
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Python
•
MIT License
•177•1•0•0•Updated Oct 15, 2023Oct 15, 2023
rl.pyro-satisfia
Public
Jupyter Notebook
•1•0•0•0•Updated Oct 4, 2023Oct 4, 2023
alpaca_farm-collective
Public
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Python
•
Apache License 2.0
•59•0•0•0•Updated Aug 25, 2023Aug 25, 2023
ai-safety-gridworlds-satisfia
Public
This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.
Python
•
Apache License 2.0
•121•0•0•0•Updated Jul 17, 2023Jul 17, 2023
Minigrid-satisfia
Public
Simple and easily configurable grid world environments for reinforcement learning
Python
•
Other
•612•0•0•0•Updated Jul 13, 2023Jul 13, 2023
pymdptoolbox-satisfia
Public
Markov Decision Process (MDP) Toolbox for Python
Jupyter Notebook
•
BSD 3-Clause "New" or "Revised" License
•252•0•0•0•Updated Jun 30, 2023Jun 30, 2023
alpaca_eval-collective
Public
A validated automatic evaluator for instruction-following language models. High-quality, cheap, and fast.
Jupyter Notebook
•
Apache License 2.0
•246•0•0•0•Updated Jun 30, 2023Jun 30, 2023
motabarnn
Public
python package for torch-based neural network version of MoTaBaR
Python
•
Apache License 2.0
•0•0•0•6•Updated Jun 21, 2023Jun 21, 2023
stable-baselines3-satisfia
Public
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Python
•
MIT License
•1.7k•0•0•0•Updated Jun 13, 2023Jun 13, 2023
RL4LMs_RLCHF
Public
A modular RL library to fine-tune language models to human preferences
Python
•
Apache License 2.0
•191•0•0•0•Updated May 31, 2023May 31, 2023
pyoptes
Public
Python framework for optimization of epidemic testing strategies
Python
•
BSD 2-Clause "Simplified" License
•0•1•0•0•Updated Apr 12, 2023Apr 12, 2023
train-procgen-pytorch-satisfia
Public
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
Python
•24•0•0•0•Updated Mar 13, 2023Mar 13, 2023
decision-transformer-satisfia
Public
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Python
•
MIT License
•454•0•0•0•Updated Feb 3, 2023Feb 3, 2023
procgenAISC-satisfia
Public
C++
•
MIT License
•16•0•0•0•Updated Jan 21, 2023Jan 21, 2023
tricl
Public
TriCl model in C++
C++
•
GNU General Public License v3.0
•0•0•2•0•Updated Sep 23, 2022Sep 23, 2022
attainable-utility-preservation-satisfia
Public
Python
•
Apache License 2.0
•121•0•0•0•Updated Jan 11, 2022Jan 11, 2022
pyresponsibility
Public
quantify agents' degrees of moral responsibility in complex multi-agent decision situations
Python
•
BSD 2-Clause "Simplified" License
•0•0•0•0•Updated Dec 15, 2021Dec 15, 2021
avoiding-side-effects-satisfia
Public
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
Python
•
Apache License 2.0
•2•0•0•0•Updated Jun 3, 2021Jun 3, 2021
rl-inference-satisfia
Public
Reinforcement Learning through Active Inference with additional safety measures
Python
•22•0•0•0•Updated Apr 27, 2020Apr 27, 2020
agentmodels-satisfia
Public
Modeling agents with probabilistic programs
TeX
•17•0•0•0•Updated Sep 4, 2019Sep 4, 2019