PPO benchmark on brax environments

We benchmarked PPO on several complex continuous control tasks coming from Brax suit (https://github.com/google/brax). We provide in this section results and hyperparameters to reproduce the experiments in GPU (both acquisition and loss computation).

Results

The following curves are obtained by with the hyperparameters we provide in the yaml files (averaged over 5 different seeds). Run are bout 30 minutes using on GPU.

Running experiments

Make sure your version of Brax is up to date (https://github.com/google/brax)
If you want to change the env tested, choose the correct yaml file name in ppo.py (line 117)
Run OMP_NUM_THREADS=1 XLA_PYTHON_CLIENT_PREALLOCATE=false python ppo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PPO benchmark on brax environments

Results

Running experiments

Files

README.md

Latest commit

History

README.md

File metadata and controls

PPO benchmark on brax environments

Results

Running experiments