Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)
Abstract
Meta-Reinforcement Learning (RL) algorithms promise to leverage prior task experience to quickly learn new unseen tasks. Unfortunately, evaluating meta-RL algorithms is complicated by a lack of suitable benchmarks. In this paper we propose adapting a challenging real-world heating, ventilation and air-conditioning (HVAC) control benchmark for meta-RL. Unlike existing benchmark problems, HVAC control has a broader task distribution, and sources of exogenous stochasticity from price and weather predictions which can be shared across task definitions. This can enable greater differentiation between the performance of current meta-RL approaches, and open the way for future research into algorithms that can adapt to entirely new tasks not sampled from the current task distribution.
Cite
Text
Grewal et al. "Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I18.17889Markdown
[Grewal et al. "Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/grewal2021aaai-evaluating/) doi:10.1609/AAAI.V35I18.17889BibTeX
@inproceedings{grewal2021aaai-evaluating,
title = {{Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)}},
author = {Grewal, Yashvir S. and de Nijs, Frits and Goodwin, Sarah},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2021},
pages = {15785-15786},
doi = {10.1609/AAAI.V35I18.17889},
url = {https://mlanthology.org/aaai/2021/grewal2021aaai-evaluating/}
}