Based on PARL, the DDPG algorithm of deep reinforcement learning has been reproduced, reaching the same level of indicators as the paper in Atari benchmarks.
Please see here to know more about Mujoco games.
- python3.5+
- paddlepaddle>=1.5.1
- parl
- gym
- tqdm
- mujoco-py>=1.50.1.0
# To train an agent for HalfCheetah-v2 game
python train.py
# To train for other game
# python train.py --env [ENV_NAME]