PPO

Minimal implementation of Proximal Policy Optimization (PPO) in PyTorch

support discrete and continuous action space
- In continuous action space, we use the constance std for sampling.
utils to plot learning graphs in tensorboard

Update

2023-09-09
- Update "Generative Adversarial Imitation Learning(GAIL)"

Train

Find or make a config file and run the following command.

python main.py --config=configs/Ant-v4.yaml 
               --exp_name=test
               --train

Make expert dataset for gail

python make_expert_dataset.py --experiment_path=checkpoints/Ant/test
                              --load_postfix=last
                              --minimum_score=5000
                              --n_episode=30

How to play

python main.py --experiment_path=checkpoints/Ant/test
               --eval
               --eval_n_episode=50
               --load_postfix=last
               --video_path=videos/Ant

load_path: pretrained model prefix(ex/ number of episode, 'best' or 'last') to play

Result

Ant-v4

Environment	Performance Chart	Evaluation Video
Ant-v4		ant.mp4
Ant-v4 (GAIL)		ant_gail.mp4
Reacher-v4		reacher.mp4
HalfCheetah-v4		cheetah.mp4

Reference

IMPLEMENTATION MATTERS IN DEEP POLICY GRADIENTS: A CASE STUDY ON PPO AND TRPO
https://github.com/junkwhinger/PPO_PyTorch
https://github.com/nikhilbarhate99/PPO-PyTorch

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.devcontainer		.devcontainer
configs		configs
custom_env		custom_env
plots		plots
utils		utils
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
main.py		main.py
make_expert_dataset.py		make_expert_dataset.py
model.py		model.py
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
scheduler.py		scheduler.py
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO

Update

Train

Make expert dataset for gail

How to play

Result

Ant-v4

Reference

About

Releases

Packages

Languages

Ladun/PPO

Folders and files

Latest commit

History

Repository files navigation

PPO

Update

Train

Make expert dataset for gail

How to play

Result

Ant-v4

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages