TensorFlow Agents 1.2.0
Features:
- Use single optimizer for PPO to train shared feature layers better.
- Allow calling methods of the process environment.
Improvements:
- Improve default and MuJoCo configs.
- Report both training and evaluation scores.
Bugs:
- Likelihood calculation halved gradients for the action standard deviation.