Official code for Continual Learning In Environments With Polynomial Mixing Times
This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"
Clone this repository and then follow this command
cd polynomial-mixing-times
Create either use a python virtualenv or a conda environment and activate it.
pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate
To install all the relevant packages use the following command:
pip install -e .
We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh
to run all the models.
To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command
bash run_bottleneck.sh 1 4 "random"
- Online Q learning
- Q learning with Replay
- Q learning w/ Dyna
- Model based n-step TD
- Vanilla Policy Gradient
- Onpolicy rho learning
- Off-policy rho learning
- rho Policy Gradient
ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0
We used python 3.7 version to run all our experiments.