Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)

Yashvir S. Grewal, Frits de Nijs, Sarah Goodwin

AAAI 2021 pp. 15785-15786

doi:10.1609/AAAI.V35I18.17889 /aaai/2021/grewal2021aaai-evaluating/

Abstract

Meta-Reinforcement Learning (RL) algorithms promise to leverage prior task experience to quickly learn new unseen tasks. Unfortunately, evaluating meta-RL algorithms is complicated by a lack of suitable benchmarks. In this paper we propose adapting a challenging real-world heating, ventilation and air-conditioning (HVAC) control benchmark for meta-RL. Unlike existing benchmark problems, HVAC control has a broader task distribution, and sources of exogenous stochasticity from price and weather predictions which can be shared across task definitions. This can enable greater differentiation between the performance of current meta-RL approaches, and open the way for future research into algorithms that can adapt to entirely new tasks not sampled from the current task distribution.

PDF AAAI Semantic Scholar

Cite

Text

Grewal et al. "Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I18.17889

Markdown

[Grewal et al. "Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/grewal2021aaai-evaluating/) doi:10.1609/AAAI.V35I18.17889

BibTeX

@inproceedings{grewal2021aaai-evaluating,
  title     = {{Evaluating Meta-Reinforcement Learning Through a HVAC Control Benchmark (Student Abstract)}},
  author    = {Grewal, Yashvir S. and de Nijs, Frits and Goodwin, Sarah},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {15785-15786},
  doi       = {10.1609/AAAI.V35I18.17889},
  url       = {https://mlanthology.org/aaai/2021/grewal2021aaai-evaluating/}
}