Name		Name	Last commit message	Last commit date
parent directory ..
images		images
README.md		README.md
pong.py		pong.py

README.md

Pong

Trains a Pong agent using policy gradients on OpenAI's gym. This code was copied from Andrej Karpathy's Deep Reinforcement Learning: Pong from Pixels, and almost all changes to the code were for cosmetic purposes. Please refere to Karpathy's walkthrough to learn more about the implementation!

Usage

python3 pong.py

Set resume = True in pong.py if you want to continue training the agent where it was left off in model.p, otherwise set resume = False to start the agent training from scratch.

Output

Resuming model 'model.p'...
ep 0: game finished, reward: 1.000000
ep 0: game finished, reward: 1.000000
ep 0: game finished, reward: 1.000000
ep 0: game finished, reward: -1.000000
ep 0: game finished, reward: 1.000000
ep 0: game finished, reward: -1.000000
...
...
...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pong

pong

README.md

Pong

Usage

Output

Files

pong

Directory actions

More options

Directory actions

More options

Latest commit

History

pong

Folders and files

parent directory

README.md

Pong

Usage

Output