[NeurIPS 2023 Spotlight] Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

This repository is the official source code for Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning [arXiv page] [project page] [OpenReview page], which has been accepted as a spotlight presentation at NeurIPS 2023. (Primary Contact: Shenzhi Wang)

This codebase includes:

The implementation of FamO2O using JAX IQL, located in the jax_iql folder. For detailed instructions, please see the jax_iql README.
The implementation of FamO2O using JAX CQL, located in the jax_cql folder. For additional information, please refer to the jax_cql README.

We would greatly appreciate it if you could cite our work!

@inproceedings{
wang2023train,
title={Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning},
author={Shenzhi Wang and Qisen Yang and Jiawei Gao and Matthieu Gaetan Lin and Hao Chen and Liwei Wu and Ning Jia and Shiji Song and Gao Huang},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=vtoY8qJjTR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
jax_cql		jax_cql
jax_iql		jax_iql
LICENSE		LICENSE
README.md		README.md
teaser.svg		teaser.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[NeurIPS 2023 Spotlight] Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

About

Releases

Packages

Contributors 2

Languages

License

LeapLabTHU/FamO2O

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2023 Spotlight] Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages