GitHub - tianbingsz/SVRG: Stochastic Variance Reduction Policy Gradient Estimation

SVRG

Contributors and Collaborators: Tianbing Xu (Baidu Research, CA), Qiang Liu (UT, Austin), Jian Peng (UIUC)

Contributions:

The variance of the policy gradient estimates obtained from the simulation is often excessive, leading to poor sample efficiency. In this paper, we apply the stochastic variance reduced gradient descent (SVRG) to model-free policy gradient to improve the sample-efficiency. The SVRG estimation is incorporated into a trust-region Newton conjugate gradient framework for the policy optimization.

Dependencies

Rllab (https://github.com/rll/rllab)
Python 3.6
The Usual Suspects: NumPy, matplotlib, scipy
TensorFlow
gym - installation instructions
MuJoCo (30-day trial available and free to students)

Refer to requirements.txt for more details.

Running Command

After launching the virtual env, set up PYTHONPATH and Mujoco PATH,

source start.sh

Run experiment

cd sandbox/rocky/tf/launchers/
python trpo_gym_swimmer.py

Results (MuJoco Robotics Tasks)

Reference

Tianbing Xu, Qiang Liu, Jian Peng, "Stochastic Variance Reduction for Policy Gradient Estimation", arXiv, 2017
S. S. Du, J. Chen, L. Li, L. Xiao, and D. Zhou, “Stochastic variance reduction methods for policy evaluation,”ICML, 2017
R. Johnson and T. Zhang, “Accelerating stochastic gradient descent using predictive variance reduction, NIPS 2013
A. Owen and Y. Zhou, “Safe and effective importance sampling,”JASA, 2000
Yan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter Abbeel. "Benchmarking Deep Reinforcement Learning for Continuous Control". ICML 2016

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
examples		examples
rllab		rllab
sandbox		sandbox
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVRG

Contributions:

Dependencies

Running Command

Results (MuJoco Robotics Tasks)

Reference

About

Releases

Packages

Languages

License

tianbingsz/SVRG

Folders and files

Latest commit

History

Repository files navigation

SVRG

Contributions:

Dependencies

Running Command

Results (MuJoco Robotics Tasks)

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages