Split Q Learning: Reinforcement Learning with Two-Stream Rewards
Abstract
Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. For AI community, the development of agents that react differently to different types of rewards can enable us to understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions and user preferences in long-term recommendation systems.
Cite
Text
Lin et al. "Split Q Learning: Reinforcement Learning with Two-Stream Rewards." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/913Markdown
[Lin et al. "Split Q Learning: Reinforcement Learning with Two-Stream Rewards." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/lin2019ijcai-split/) doi:10.24963/IJCAI.2019/913BibTeX
@inproceedings{lin2019ijcai-split,
title = {{Split Q Learning: Reinforcement Learning with Two-Stream Rewards}},
author = {Lin, Baihan and Bouneffouf, Djallel and Cecchi, Guillermo A.},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2019},
pages = {6448-6449},
doi = {10.24963/IJCAI.2019/913},
url = {https://mlanthology.org/ijcai/2019/lin2019ijcai-split/}
}