rl-learn

This is the code for our IJCAI 2019 paper Using Natural Language for Reward Shaping in Reinforcement Learning.

Running the code:

Clone this repository and install dependencies using the included requirements.txt file. The code requires Python 3.
Download preprocessed data:

wget http://www.cs.utexas.edu/~ml/pgoyal/ijcai19/train_lang_data.pkl -O ./data/train_lang_data.pkl
wget http://www.cs.utexas.edu/~ml/pgoyal/ijcai19/test_lang_data.pkl -O ./data/test_lang_data.pkl

Run the LEARN module training and RL training using the following commands:

mkdir learn_model
python learn/train.py --lang_enc=onehot --save_path=./learn_model
python rl/main.py --expt_id=<expt_id> --descr_id=<descr_id> --lang_coeff=1.0 --lang_enc=onehot --model_dir=./learn_model

Data

Raw data can be downloaded from http://www.cs.utexas.edu/~ml/pgoyal/ijcai19/atari-lang.zip. The directories contain frames from Montezuma's revenge (downloaded from Atari Grand Challenge dataset). The file annotations.txt contains pairs of clip ids and natural language descriptions. The clip id is formatted as <directory_name>/<start_frame>-<end_frame>.mp4

Preprocessed data can be generated from the raw data as follows:

Download the InferSent model using the following command:

wget http://www.cs.utexas.edu/~ml/pgoyal/ijcai19/infersent1.pkl -O ./lang_enc_pretrained/InferSent/encoder/infersent1.pkl

Download pretrained GloVe vectors (glove.6B.zip) from https://nlp.stanford.edu/projects/glove/. Put the unzipped files in lang_enc_pretrained/glove.
Run the preprocessing code as follows:

python scripts/preprocess_data.py

This will create files train_lang_data.pkl and test_lang_data.pkl in the ./data directory.

Acknowledgements:

The RL code is adapted from the following implementation -- https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rl-learn

Running the code:

Data

Acknowledgements:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
lang_enc_pretrained/InferSent		lang_enc_pretrained/InferSent
learn		learn
rl		rl
scripts		scripts
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

prasoongoyal/rl-learn

Folders and files

Latest commit

History

Repository files navigation

rl-learn

Running the code:

Data

Acknowledgements:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages