IU-Reinforcement-Learning-22-lab

Week01 - Introduction

By the end of this lab you are expected to:

Have a quick review for the lecture 01.
Understand the fundamental concepts of reinforcement learning and learn to test your algorithms with OpenAI gym.
Training your first reinforcement learning model using stable-baselines3 , evaluate, test, use callbacks and learn how to save and load the RL models.

Week02 - Exploration and exploitation

In this lab you will implement several exploration strategies for simplest problem - bernoulli bandit.

Week03 - Markov Decision Process

By the end of this lab you will understand how Markov Reward Process and Markov Decision Process work. Also you will apply the direct solution to find the optimal policy.

Week04 - Dynamic Programming

Policy Iteration and Value Iteration algorithms.

Week05 - Model-Free Reinforcement Learning (MC Prediction)

Monte Carlo for Prediction.

Week06 - Model-Free Reinforcement Learning (TD Prediction)

Temporal Difference Prediction

Week07 - Model-Free Reinforcement Learning (Model-Free Control)

By the end of this lab you will understand the difference between Model-Free Prediction and Model-Free Control and you will be familiar with On-Policy vs Off-Policy Learning, and you will implement SARSA & Q-Learning algorithms from scratch.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
week01-intro		week01-intro
week02-xxp		week02-xxp
week03-mdp		week03-mdp
week04-dp		week04-dp
week05-mc-prediction		week05-mc-prediction
week06-TD-prediction		week06-TD-prediction
week07-control		week07-control
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup_colab.sh		setup_colab.sh
xvfb		xvfb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IU-Reinforcement-Learning-22-lab

Week01 - Introduction

Week02 - Exploration and exploitation

Week03 - Markov Decision Process

Week04 - Dynamic Programming

Week05 - Model-Free Reinforcement Learning (MC Prediction)

Week06 - Model-Free Reinforcement Learning (TD Prediction)

Week07 - Model-Free Reinforcement Learning (Model-Free Control)

References

Books

Online materials

About

Releases

Packages

Languages

License

mhd-medfa/IU-Reinforcement-Learning-22-lab

Folders and files

Latest commit

History

Repository files navigation

IU-Reinforcement-Learning-22-lab

Week01 - Introduction

Week02 - Exploration and exploitation

Week03 - Markov Decision Process

Week04 - Dynamic Programming

Week05 - Model-Free Reinforcement Learning (MC Prediction)

Week06 - Model-Free Reinforcement Learning (TD Prediction)

Week07 - Model-Free Reinforcement Learning (Model-Free Control)

References

Books

Online materials

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages