Skip to content

Commit

Permalink
Merge branch 'master' into fix-sarsa-gpu
Browse files Browse the repository at this point in the history
  • Loading branch information
muupan authored Aug 29, 2019
2 parents 1938f7f + 0158d0b commit a3cf4b2
Showing 1 changed file with 15 additions and 15 deletions.
30 changes: 15 additions & 15 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,21 +36,21 @@ For more information, you can refer to [ChainerRL's documentation](http://chaine

## Algorithms

| Algorithm | Discrete Action | Continous Action | Recurrent Model | CPU Async Training |
|:----------|:---------------:|:----------------:|:---------------:|:------------------:|
| DQN (including DoubleDQN etc.) || ✓ (NAF) || x |
| Categorical DQN || x || x |
| Rainbow || x || x |
| IQN || x || x |
| DDPG | x ||| x |
| A3C |||||
| ACER |||||
| NSQ (N-step Q-learning) || ✓ (NAF) |||
| PCL (Path Consistency Learning) |||||
| PPO |||| x |
| TRPO |||| x |
| TD3 | x || x | x |
| SAC | x || x | x |
| Algorithm | Discrete Action | Continous Action | Recurrent Model | Batch Training | CPU Async Training |
|:----------|:---------------:|:----------------:|:---------------:|:--------------:|:------------------:|
| DQN (including DoubleDQN etc.) || ✓ (NAF) || | x |
| Categorical DQN || x || | x |
| Rainbow || x || | x |
| IQN || x || | x |
| DDPG | x ||| | x |
| A3C ||||(A2C) ||
| ACER |||| x | |
| NSQ (N-step Q-learning) || ✓ (NAF) || x | |
| PCL (Path Consistency Learning) |||| x | |
| PPO |||| | x |
| TRPO |||| | x |
| TD3 | x || x | | x |
| SAC | x || x | | x |

Following algorithms have been implemented in ChainerRL:
- [A2C (Synchronous variant of A3C)](https://openai.com/blog/baselines-acktr-a2c/)
Expand Down

0 comments on commit a3cf4b2

Please sign in to comment.