Skip to content

Implementation of Policy Gradient methods for Reinforcement Learning

Notifications You must be signed in to change notification settings

evgenii-nikishin/pg_pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A library with implementations of popular Policy Gradient methods for Reinforcement Learning

Includes the following algorithms:

The code is organized as follows:

  • src/agent.py: implementations of Agent.
  • src/env_wrappers.py: wrapper for multiple parallel environments
  • src/kfac.py: implementation of K-FAC optimizer, compatible with torch.optim.Optimizer
  • src/networks.py: neural network architectures of actor and critic for different environments
  • src/trpo.py: implementation of TRPO-optimizer routines
  • src/utils.py: utils for models and optimizers

TODO:

  • effective vectorization with n-step returns
  • PPO
  • A3C

Code is developed and supported by:


Inherited from https://github.com/nikishin-evg/acktr_pytorch

About

Implementation of Policy Gradient methods for Reinforcement Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published