Rating-Based Reinforcement Learning
Abstract
This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individual trajectories without relative comparisons between sample pairs. The rating-based reinforcement learning approach builds on a new prediction model for human ratings and a novel multi-class loss function. We conduct several experimental studies based on synthetic ratings and real human ratings to evaluate the effectiveness and benefits of the new rating-based reinforcement learning approach.
Cite
Text
White et al. "Rating-Based Reinforcement Learning." ICML 2023 Workshops: MFPL, 2023.Markdown
[White et al. "Rating-Based Reinforcement Learning." ICML 2023 Workshops: MFPL, 2023.](https://mlanthology.org/icmlw/2023/white2023icmlw-ratingbased/)BibTeX
@inproceedings{white2023icmlw-ratingbased,
title = {{Rating-Based Reinforcement Learning}},
author = {White, Devin and Wu, Mingkang and Novoseller, Ellen and Lawhern, Vernon and Waytowich, Nicholas R and Cao, Yongcan},
booktitle = {ICML 2023 Workshops: MFPL},
year = {2023},
url = {https://mlanthology.org/icmlw/2023/white2023icmlw-ratingbased/}
}