Bandit

This repository aims to provide the simple code and tutorial for bandit algorithm.

What is bandit algorithm?

Bandit or Multi-Armed Bandit or Contextual Bandit is a problem in reinforcement learning. It is a problem where an agent has to choose one of the actions from a set of actions. The agent gets a reward for each action and the goal is to maximize the total reward.

Setup

This repository uses Poetry as a dependency manager. To install the dependencies, run:

$ poetry install

To activate the virtual environment, run:

$ poetry shell

TODO

Add contextual bandit algorithms such as LinUCB, LinThompsomSampling, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bandit		bandit
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bandit

What is bandit algorithm?

Setup

TODO

About

Releases

Packages

Languages

License

nutorbit/bandit

Folders and files

Latest commit

History

Repository files navigation

Bandit

What is bandit algorithm?

Setup

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages