shaper

A library for shaping the reward in RL. This library includes the implementations of the following papers.

T. Okudo and S. Yamada, "Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning," in IEEE Access, vol. 9, pp. 97557-97568, 2021, doi: 10.1109/ACCESS.2021.3090364.
- You can use the same method by DynamicTrajectoryAggregation and SubgoalRS
T. Okudo and S. Yamada, "Reward Shaping with Dynamic Trajectory Aggregation," 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-9, doi: 10.1109/IJCNN52387.2021.9533401.
- You can use the same method by DynamicTrajectoryAggregation and SarsaRS

Installation

pip install -e .

How to use

Please write like the following script. You can check the examples of domain-specific achiever here

import shaper
from shaper.achiever.interface import AbstractAchiever
from shaper.aggregator.subgoal_based import DynamicTrajectoryAggregation
import gym

# How to create the reward shaping instance.
def is_success(done, info):
    if "is_success" in info:
        return info["is_success"]
    return done

# Achiever is domain-specific. You can see the implementation examples in "examples" directory.
achiever = AbstractAchiever()
aggregator = DynamicTrajectoryAggregation(achiever)
vfunc = aggregator.create_vfunc()
rs = shaper.SarsaRS(gamma, lr, aggregator, vfunc, is_success=is_success)

# How to use in RL loop.
env = gym.create("CartPole-v1")
pre_obs = env.reset()

for i in range(100):
    action = env.action_space.sample()
    obs, reward, done, info = env.step(action)
    shaping_reward = rs.step(pre_obs, action, reward, obs, done, info)

Aggregator objects

DynamicTrajectoryAggregation
Discretizer

Shaping objects

SarsaRS
SubgoalRS
NaiveRS

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
examples		examples
shaper		shaper
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shaper

Installation

How to use

Aggregator objects

Shaping objects

About

Releases

Packages

Languages

License

takato86/shaper

Folders and files

Latest commit

History

Repository files navigation

shaper

Installation

How to use

Aggregator objects

Shaping objects

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages