SEED

This repository contains an implementation of distributed reinforcement learning agent where both training and inference are performed on the learner.

Two agents are implemented:

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
R2D2 (Recurrent Experience Replay in Distributed Reinforcement Learning)

The code is already interfaced with the following environments:

However, any reinforcement learning environment using the gym API can be used.

For a detailed description of the architecture please read our paper. Please cite the paper if you use the code from this repository in your work.

Bibtex

@article{seed2019,
  author    = {Lasse Espeholt and Rapha{\"e}l Marinier and Piotr Stanczyk and Ke Wang and Marcin Michalski},
  title     = {{SEED RL:} Scalable and Efficient Deep-RL with Accelerated Central Inference},
  year      = {2019},
  url       = {http://arxiv.org/abs/xxx},
  archivePrefix = {arXiv},
}

Prerequisites

There are a few steps you need to take before playing with SEED. Instructions below assume you run the Ubuntu distribution.

Install docker by following instructions at https://docs.docker.com/install/linux/docker-ce/ubuntu/. You need 19.03 version or later due to required GPU support.
Make sure docker works as non-root user by following instructions at https://docs.docker.com/install/linux/linux-postinstall, section Manage Docker as a non-root user.
Install git:

apt-get install git

Clone SEED git repository:

git clone https://github.com/anonymized/seed_rl.git
cd seed_rl

Local Machine Training on a Single Level

To easily start with SEED we provide a way of running it on a local machine. You just need to run the following command:

./run_local.sh [atari|dmlab|football]

It will build a Docker image using SEED source code and start the training inside the Docker image.

Distributed Training using AI Platform

Note that training with AI Platform results in charges for using compute resources.

The first step is to configure GCP and a Cloud project you will use for training:

Install Cloud SDK following instructions at https://cloud.google.com/sdk/install and setup up your GCP project.
Make sure that billing is enabled for your project.
Enable the AI Platform ("Cloud Machine Learning Engine") and Compute Engine APIs.
Grant access to the AI Platform service accounts as described at https://cloud.google.com/ml-engine/docs/working-with-cloud-storage.
Cloud-authenticate in your shell, so that SEED scripts can use your project:

gcloud auth login
gcloud config set project [YOUR_PROJECT]

Then you just need to execute one of the provided scenarios:

gcp/train_[scenario_name].sh

This will build the Docker image, push it to the repository which AI Platform can access and start the training process on the Cloud. Follow output of the command for progress. You can also view the running training jobs at https://console.cloud.google.com/ml/jobs

DeepMind Lab Level Cache

By default majority of DeepMind Lab's CPU usage is generated by creating new scenarios. This cost can be eliminated by enabling level cache. To enable it, set the level_cache_dir flag in the dmlab/config.py. As there are many unique episodes it is a good idea to share the same cache across multiple experiments. For AI Platform you can add --level_cache_dir=gs://${BUCKET_NAME}/dmlab_cache to the list of parameters passed in gcp/submit.sh to the experiment.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
atari		atari
dmlab		dmlab
docker		docker
football		football
gcp		gcp
grpc		grpc
tests		tests
utils		utils
BUILD		BUILD
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
run_local.sh		run_local.sh
stop_local.sh		stop_local.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEED

Bibtex

Prerequisites

Local Machine Training on a Single Level

Distributed Training using AI Platform

DeepMind Lab Level Cache

About

Releases

Packages

Languages

License

qstanczyk/seed_rl

Folders and files

Latest commit

History

Repository files navigation

SEED

Bibtex

Prerequisites

Local Machine Training on a Single Level

Distributed Training using AI Platform

DeepMind Lab Level Cache

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages