Alpha Zero Boosted

A "build to learn" implementation of the Alpha Zero algorithm written in Python that uses LightGBM (Gradient Boosted Decision Trees) in place of a Deep Neural Network for value/policy functions.

A few environments (i.e., games) are implemented: Quoridor, Connect Four, and Tic-Tac-Toe.

Running

Play a game

python3.7 play.py <environment> <species-generation> <species-generation> <time-to-move>

python3.7 play.py connect_four gbdt-1 human-1 5.0

Train a bot

python3.7 train_bot.py <environment> <species> <num batches>

python3.7 train_bot.py connect_four gbdt 10

Setup

Install

If you haven't already, install pyenv/pyenv-virtualenv (see Install pyenv/pyenv-virtualenv below)

Clone repo:

git clone git@github.com:cgreer/alpha-zero-boosted.git

Create a virtual environment for the project:

cd alpha-zero-boosted
pyenv install 3.7.7
pyenv virtualenv 3.7.7 alpha_boosted_env
pyenv local alpha_boosted_env

Install packages:

pip install -r requirements.txt

MacOS Issue: Because some wheels don't appear to be built properly, you may need to first install libomp and then retry installing packages:

brew install libomp

# Then try installing again
pip install -r requirements.txt

Install pyenv/pyenv-virtualenv

pyenv

Install the plugin:

brew install pyenv

pyenv-virtualenv plugin

Note: These instructions are copied here for convenience. Check pyenv-virtualenv to ensure they are up to date.

Install the plugin:

brew install pyenv-virtualenv

Add the following two lines to your profile file (~/.zprofile if using zsh, ~/.bash_profile if bash):

eval "$(pyenv init -)"
eval "$(pyenv virtualenv-init -)"

Restart terminal (so profile commands above execute).

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
images		images
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
agent_replay.py		agent_replay.py
assess.py		assess.py
batch_info.py		batch_info.py
bootstrap.py		bootstrap.py
code_timing.py		code_timing.py
connect_four.py		connect_four.py
diversity.py		diversity.py
environment.py		environment.py
environment_registry.py		environment_registry.py
evaluation.py		evaluation.py
figures.py		figures.py
gbdt.py		gbdt.py
gbdt_model.py		gbdt_model.py
generation_info.py		generation_info.py
helpers.py		helpers.py
human_agent.py		human_agent.py
intuition_model.py		intuition_model.py
leftright.py		leftright.py
mcts_agent.py		mcts_agent.py
noise_maker.py		noise_maker.py
paths.py		paths.py
play.py		play.py
quoridor.py		quoridor.py
random_agent.py		random_agent.py
replay.py		replay.py
requirements.txt		requirements.txt
sample_lgbm.json		sample_lgbm.json
self_play.py		self_play.py
settings.py		settings.py
species.py		species.py
stats.py		stats.py
surprise.py		surprise.py
system_monitoring.py		system_monitoring.py
table_operations.py		table_operations.py
text.py		text.py
tictactoe.py		tictactoe.py
train.py		train.py
train_bot.py		train_bot.py
training_info.py		training_info.py
training_samples.py		training_samples.py
treelite_model.py		treelite_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Alpha Zero Boosted

Running

Play a game

Train a bot

Setup

Install

Install pyenv/pyenv-virtualenv

pyenv

pyenv-virtualenv plugin

About

Releases

Packages

Languages

tricao/alpha-zero-boosted

Folders and files

Latest commit

History

Repository files navigation

Alpha Zero Boosted

Running

Play a game

Train a bot

Setup

Install

Install pyenv/pyenv-virtualenv

pyenv

pyenv-virtualenv plugin

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages