You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fixed an issue with cartpole_swingup's best_episode not being computed correctly due to negative rewards (beb1630). This change will affect scores for this experiment, so we bump the minor version in keeping with semantic versioning.
Fix versioning issues in colab and setup.py.
Improve test.sh coverage.
Make TAGS accessible as sweep (e.g. sweep.TAGS['exploration'] to retrieve all exploration experiments.
Add a bootstrapped DQN baseline in JAX.
Extract environments to their own package for easier direct access for debugging/development.