Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning
Abstract
A ubiquitous requirement in many practical reinforcement learning (RL) applications is that the deployed policy that actually interacts with the environment cannot change frequently. Such an RL setting is called low-switching-cost RL, i.e., achieving the highest reward while reducing the number of policy switches during training. It has been a recent trend in theoretical RL research to develop provably efficient RL algorithms with low switching cost. The core idea in these theoretical works is to measure the information gain and switch the policy when the information gain is doubled. Despite of the theoretical advances, none of existing approaches have been validated empirically. We conduct the first empirical evaluation of different policy switching criteria on popular RL testbeds, including a medical treatment environment, the Atari games, and robotic control tasks. Surprisingly, although information-gain-based methods do recover the optimal rewards, they often lead to a substantially higher switching cost. By contrast, we find that a feature-based criterion, which has been largely ignored in the theoretical research, consistently produces the best performances over all the domains. We hope our benchmark could bring insights to the community and inspire future research. Our code and complete results can be found at https: // sites. google. com/ view/ low-switching-cost-rl
Cite
Text
Xu et al. "Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning." Transactions on Machine Learning Research, 2023.Markdown
[Xu et al. "Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/xu2023tmlr-beyond/)BibTeX
@article{xu2023tmlr-beyond,
title = {{Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning}},
author = {Xu, Shusheng and Liang, Yancheng and Li, Yunfei and Du, Simon Shaolei and Wu, Yi},
journal = {Transactions on Machine Learning Research},
year = {2023},
url = {https://mlanthology.org/tmlr/2023/xu2023tmlr-beyond/}
}