to-kpm

The official code for the paper "Task-Oriented Koopman-Based Control with Contrastive Encoder"

Preparation

Install conda and create a conda environment (python 3.7.13)
conda create env --name kpmlilat
Install necessary dependencies
pip install -r requirements.txt
Verify if the simulator dmc2gym works, referring to https://github.com/denisyarats/dmc2gym.

Supported RL tasks

Simulated environments of DeepMind dm_control.

CartPole Swingup 4D
Cheetah Running 18D
CartPole Swingup Pixel

Usage

For training, check test_one_staged_lqr_pixel.py and run run.sh. Remember to change the CUDA available devices to your setup, and modify the paths included in the config file (under the folder config).
For evalution, check test_one_staged_lqr_pixel_only_evaluate.py.
All utilities functions are in utils.py. All plot functions are in paperplot.py and rebuttalplot.py
Use autoencoder other than contrastive encoder, run test_one_staged_lqr_pixel_AE (performance not good, mainly as a comparison).
Two-stage approach test is in test_two_staged_lqr.py (not fully tested, performance not good, mainly as a comparison)

Useful references and codebases

[1] Laskin, Michael, Aravind Srinivas, and Pieter Abbeel. "Curl: Contrastive unsupervised representations for reinforcement learning." International Conference on Machine Learning. PMLR, 2020. (https://github.com/MishaLaskin/curl)

[2] Yin, H., Welle, M. C., and Kragic, D. Embedding Koopman Optimal Control in Robot Policy Learning. 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). (https://github.com/navigator8972/koopman_policy)

Cite the paper

@article{lyu2023task,
  title={Task-Oriented Koopman-Based Control with Contrastive Encoder},
  author={Lyu, Xubo and Hu, Hanyang and Siriya, Seth and Pu, Ye and Chen, Mo},
  journal={arXiv preprint arXiv:2309.16077},
  year={2023}
}

Contact the author

Please feel free to leave any question on github issues or contact me via email lvxubo92 at gmail dot com.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

to-kpm

Preparation

Supported RL tasks

Usage

Useful references and codebases

Cite the paper

Contact the author

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
goal_images		goal_images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
paperplot.py		paperplot.py
rebuttalplot.py		rebuttalplot.py
requirements.txt		requirements.txt
run.sh		run.sh
test_one_staged_lqr_pixel.py		test_one_staged_lqr_pixel.py
test_one_staged_lqr_pixel_AE.py		test_one_staged_lqr_pixel_AE.py
test_one_staged_lqr_pixel_only_evaluate.py		test_one_staged_lqr_pixel_only_evaluate.py
test_two_staged_lqr.py		test_two_staged_lqr.py
utils.py		utils.py

License

xubo92/to-kpm

Folders and files

Latest commit

History

Repository files navigation

to-kpm

Preparation

Supported RL tasks

Usage

Useful references and codebases

Cite the paper

Contact the author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages