Heron

This is the implementation of HERON, a method for reward design from weak reward signals. This repository contains code for four different experiments: Classic Control, Robotics, Code Generation, and Multi-Agent Traffic Light Control.

Installation

For classic control, run the following commands:

pip install -r ClassicControl/requirements/requirements.txt
pip install -e gym

For robotics and code generation, follow the instructions in the respective folders.

Running Experiments

To run experiments in the Classic Control settings, run the following commands

cd ClassicControl
bash mountain.sh
bash pendulum.sh

To run experiments in the Robotics environments, run

cd robots
bash run_files/ant.sh

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ClassicControl		ClassicControl
RL-codegen		RL-codegen
gym		gym
robots		robots
.DS_Store		.DS_Store
README.md		README.md
heron_diagram.png		heron_diagram.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heron

Installation

Running Experiments

About

Releases

Packages

Languages

abukharin3/HERON

Folders and files

Latest commit

History

Repository files navigation

Heron

Installation

Running Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages