Skip to content

This project focuses on using Q-learning, a temporal difference learning approach to solving an OpenAI-Gym task.

License

Notifications You must be signed in to change notification settings

VirajVaitha123/Q-learning-to-solve-OpenAI-Gyms-Taxi-v2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Q-learning-to-solve-OpenAI-Gyms-Taxi-v3

Test

Introduction

This project focuses on using Q-learning, a temporal difference learning approach to solving an OpenAI-Gyms Taxi-v2 task.

The monte carlo control method was not sufficient to finding an optimal solution for this game. TD learning enabled the agent to learn after each timestep, instead of waiting to play an entire game (episode). There are many variations to the TD Learning Sarsa algorithms, and this particular projet uses Sarsamax (a.k.a Q Learning).

Game Description

" This task was introduced in [Dietterich2000] to illustrate some issues in hierarchical reinforcement learning. There are 4 locations (labeled by different letters) and your job is to pick up the passenger at one location and drop him off in another. You receive +20 points for a successful dropoff, and lose 1 point for every timestep it takes. There is also a 10 point penalty for illegal pick-up and drop-off actions. "

Please see https://gym.openai.com/envs/Taxi-v2/ for more information.

Instructions

Step 1: clone the repo

git clone https://github.com/VirajVaitha123/Q-learning-to-solve-OpenAI-Gyms-Taxi-v2.git

Step 2: cd to the working directory

cd filepath_to_directory 

Step 3: Install dependecies

pip install -r requirements.txt 

Step 4: Run the main python script

python main.py 

TO DO: Create Visualisations TO DO: Play game using agent! TO DO: GIFs or Images to see the game whilst playing?

About

This project focuses on using Q-learning, a temporal difference learning approach to solving an OpenAI-Gym task.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages