GitHub - Phrungck/reinforcement-learning-models: Simple implementation and comparison of three reinforcement learning models.

Reinforcement Learning Algorithms (2021 coding-style)

Implemented Q-Learning, SARSA, and Cross Entropy Method using numpy and torch and compared their performance on frozenlake-deterministic, frozenlake-stochastic, and cliffwalking.

Dependencies

OpenAI gym
matplotlib
numpy
collections
torch
itertools
plotting

Deterministic Frozenlake Results

Stochastic Frozenlake Results

Cliffwalking Results

Changing Parameters

All results showed that SARSA and Q-Learning bested Cross-entropy method for the CliffWalking environment. Changes in the hyperparameters showed significant changes. Notably, by increasing the alpha parameter Q-Learning and SARSA exceeded results of the baseline.

Increase in alpha while reducing Gamma resulted to almost similar values for all variants of Q-Learning and SARSA. However, Cross-entropy became more erratic in the process.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
scripts		scripts
README.md		README.md
train.ipynb		train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Algorithms (2021 coding-style)

Dependencies

Deterministic Frozenlake Results

Stochastic Frozenlake Results

Cliffwalking Results

Changing Parameters

About

Releases

Packages

Languages

Phrungck/reinforcement-learning-models

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Algorithms (2021 coding-style)

Dependencies

Deterministic Frozenlake Results

Stochastic Frozenlake Results

Cliffwalking Results

Changing Parameters

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages