Reinforcement Learning

This is an implementation of code for a reinforcement learning course.

Multi-armed Bandits

This repository implements a set of algorithms to solve the multi-armed bandit problem:

Furthermore, we implemented 2 sample bandit interfaces as examples of how the algorithms (agent) can interact with bandits (environment).