Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 398 Bytes

README.md

File metadata and controls

6 lines (4 loc) · 398 Bytes

tf_A3C_BipedalWalker

BipedalWalker environment from gym, solved with Asynchronous Advantage Actor Critic algorithm using Tensorflow.

Agent trained about 30000 episodes per worker in ~21 hour on a single CPU, with 4 workers.

BipedalWalker-v2 with A3C Algorithm