RUBICON
Rule-based Policy Regularization for
Reinforcement Learning-based Building Control

Introduction

Rule-based control (RBC) is widely adopted in buildings due to its stability and robustness. It resembles a behavior cloning methodology refined by human experts; however, it is incapable of adapting to distribution drifts. Reinforcement learning (RL) can adapt to changes but needs to learn from scratch in the online setting. On the other hand, the learning ability is limited in offline settings due to extrapolation errors caused by selecting out-of-distribution actions. In this paper, we explore how to incorporate RL with a rule-based control policy to combine their strengths to continuously learn a scalable and robust policy in both online and offline settings. We start with representative online and offline RL methods, TD3 and TD3+BC, respectively. Then, we develop a dynamically weighted actor loss function to selectively choose which policy for RL models to learn from at each training iteration. With extensive experiments across various weather conditions in both deterministic and stochastic scenarios, we demonstrate that our algorithm, rule-based incorporated control regularization (RUBICON), outperforms state-of-the-art methods in offline settings by $40.7%$ and improves the baseline method by $49.7%$ in online settings with respect to a reward consisting of thermal comfort and energy consumption in building-RL environments.

How to run it

Successfully install Sinergym
Git clone our repository git clone https://github.com/HYDesmondLiu/RUBICON.git
cd ./RUBICON/01_BRL/ or cd ./RUBICON/02_OnlineRL/
Modify the Sinergym*.py to fit your GPU availability.
Run python Sinergym_BRL.py or python Sinergym.py

Building BRL Dataset

The dataset we learned from for the offline approach is at https://github.com/HYDesmondLiu/B2RL

Please cite our paper if you use our codes

@inproceedings{liu2023rule,
  title={Rule-based policy regularization for reinforcement learning-based building control},
  author={Liu, Hsin-Yu and Balaji, Bharathan and Gupta, Rajesh and Hong, Dezhi},
  booktitle={Proceedings of the 14th ACM International Conference on Future Energy Systems},
  pages={242--265},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
01_BRL		01_BRL
02_OnlineRL		02_OnlineRL
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RUBICON
Rule-based Policy Regularization for
Reinforcement Learning-based Building Control

Introduction

How to run it

Building BRL Dataset

Please cite our paper if you use our codes

About

Releases

Packages

Languages

License

HYDesmondLiu/RUBICON

Folders and files

Latest commit

History

Repository files navigation

RUBICON Rule-based Policy Regularization for Reinforcement Learning-based Building Control

Introduction

How to run it

Building BRL Dataset

Please cite our paper if you use our codes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

RUBICON
Rule-based Policy Regularization for
Reinforcement Learning-based Building Control

Packages