Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Russell Mendonca*¹, Oleh Rybkin*², Kostas Daniilidis², Danijar Hafner^3,4, Deepak Pathak¹
(* equal contribution, random order)

¹Carnegie Mellon University
²University of Pennsylvania
³Google Research, Brain Team
⁴University of Toronto

Official implementation of the Lexa agent from the paper Discovering and Achieving Goals via World Models.

Setup

Create the conda environment by running :

conda env create -f environment.yml

Clone the lexa-benchmark repo, and modify the python path
export PYTHONPATH=<path to lexa-training>/lexa:<path to lexa-benchmark>

Export the following variables for rendering
export MUJOCO_RENDERER=egl; export MUJOCO_GL=egl

WARNING! Make sure to use the right python and mujoco version. The robobin environment code is known to break with other versions. Other environments might or might not work.

Training

First source the environment : source activate lexa

For training, run :

export CUDA_VISIBLE_DEVICES=<gpu_id>  
python train.py --configs defaults <method> --task <task> --logdir <log path> --time_limit <time limit>

where method can be lexa_temporal, lexa_cosine, ddl, diayn or gcsl
Supported tasks are dmc_walker_walk, dmc_quadruped_run, robobin, kitchen, joint. The time limit should be 1000 for DMC and default otherwise.

To view the graphs and gifs during training, run tensorboard --logdir <log path>

Bibtex

If you find this code useful, please cite:

@misc{lexa2021,
    title={Discovering and Achieving Goals via World Models},
    author={Mendonca, Russell and Rybkin, Oleh and
    Daniilidis, Kostas and Hafner, Danijar and Pathak, Deepak},
    year={2021},
    Booktitle={NeurIPS}
}

Acknowledgements

This code was developed using Dreamer V2 and Plan2Explore.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Setup

Training

Bibtex

Acknowledgements

Files

README.md

Latest commit

History

README.md

File metadata and controls

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Setup

Training

Bibtex

Acknowledgements