The master branch introduces many additions over the NeurIPS paper published in 2018 - improving significantly on runtime and algorithmic performance. The primary changes are detailed below:
- Parallleized rollouts
- Soft-Actor Critic (SAC) replacing the DDPG used originally
- Support for Discrete Environments using a form of DDQN + Maximum Entropy Reinforcement Learning
Please switch to neurips_paper_2018 branch if you wish to reproduce the original results from the paper
Python 3.6.9
Pytorch 1.2
Numpy 1.18.1
Gym 0.15.6
Mujoco-py v1.50.1.59
python --env
Write a gym-compatible wrapper around your environment and register it with the gym runtime