Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning
Abstract
This paper explores the application of imitation learning (IL) and reinforcement learning (RL) in HVAC control. IL learns to perform tasks by imitating a demonstrator, utilising a dataset of demonstrations. However, the performance of IL is highly dependent on the quality of the expert demonstration data. On the other hand, RL can adapt control policies based on different objectives, but for larger problems, it can be sample inefficient, requiring significant time and resources for training. To overcome the limitations of both RL and IL, we propose a combined methodology where IL is used for pre-training and RL for fine-tuning. We introduce a fine-tuning methodology to HVAC control inspired by a robot navigation task. Using the 5-Zone residential building environment provided by Sinergym, we collect state-action pairs from interactions with the environment using a rule-based policy to create a dataset of expert demonstrations. Our experiments show that this combined methodology improves the efficiency and performance of the RL agent by 1% to 11.35% compared to existing literature. This study contributes to the ongoing discourse on how imitation learning can enhance the performance of reinforcement learning in building control systems.
Cite
Text
Kadamala et al. "Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70378-2_16Markdown
[Kadamala et al. "Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/kadamala2024ecmlpkdd-enhancing/) doi:10.1007/978-3-031-70378-2_16BibTeX
@inproceedings{kadamala2024ecmlpkdd-enhancing,
title = {{Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning}},
author = {Kadamala, Kevlyn and Chambers, Des and Barrett, Enda},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2024},
pages = {256-270},
doi = {10.1007/978-3-031-70378-2_16},
url = {https://mlanthology.org/ecmlpkdd/2024/kadamala2024ecmlpkdd-enhancing/}
}