Reinforcement Learning: An Introduction

Python code for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly.

Figure 9.1: Gradient Monte Carlo algorithm on the 1000-state random walk task
Figure 9.2: Semi-gradient n-steps TD algorithm on the 1000-state random walk task
Figure 9.5: Fourier basis vs polynomials on the 1000-state random walk task
Figure 9.8: Example of feature width’s effect on initial generalization and asymptotic accuracy
Figure 9.10: Single tiling and multiple tilings on the 1000-state random walk task

Chapter 10

Chapter 11

Chapter 12

Environment

Python2 or Python3
Numpy
Matplotlib
Six
Seaborn

Usage

git clone https://github.com/ShangtongZhang/reinforcement-learning-an-introduction.git
cd reinforcement-learning-an-introduction/chapterXX
python XXX.py

Contribution

This project contains almost all the programmable figures in the book. However, when I completed this project, the book is still in draft and some chapters are still incomplete. Furthermore, due to the limited computational capacity of my machine, I can only use limited runs and episodes for some experiments, so the sample output is much less smooth than that in the book.

If you want to contribute some exercises of the book or some missing examples, fix some bugs in existing code, provide sample outputs with higher quality, add some new interesting experiments related to RL, feel free to open an issue or make a pull request. I will appreciate it very much. Also, feel free to comment on the sample outputs, some curves are really interesting.

Following are known missing figures/examples:

Example 3.4: Pole-Balancing
Example 3.6: Draw Poker
Example 5.2: Soap Bubble
Example 8.5: Rod Maneuvering
Figure 12.14: The effect of λ (I don't have time to replicate it for now)
Chapter 14 & 15 are about psychology and neuroscience
Chapter 16: Backgammon, The Acrobot, Go

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
chapter01		chapter01
chapter02		chapter02
chapter03		chapter03
chapter04		chapter04
chapter05		chapter05
chapter06		chapter06
chapter07		chapter07
chapter08		chapter08
chapter09		chapter09
chapter10		chapter10
chapter11		chapter11
chapter12		chapter12
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning: An Introduction

Contents

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Environment

Usage

Contribution

About

Releases

Packages

Languages

License

PlayPurEo/reinforcement-learning-an-introduction

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: An Introduction

Contents

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Environment

Usage

Contribution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages