Split Q Learning: Reinforcement Learning with Two-Stream Rewards

Abstract

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. For AI community, the development of agents that react differently to different types of rewards can enable us to understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions and user preferences in long-term recommendation systems.

Cite

Text

Lin et al. "Split Q Learning: Reinforcement Learning with Two-Stream Rewards." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/913

Markdown

[Lin et al. "Split Q Learning: Reinforcement Learning with Two-Stream Rewards." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/lin2019ijcai-split/) doi:10.24963/IJCAI.2019/913

BibTeX

@inproceedings{lin2019ijcai-split,
  title     = {{Split Q Learning: Reinforcement Learning with Two-Stream Rewards}},
  author    = {Lin, Baihan and Bouneffouf, Djallel and Cecchi, Guillermo A.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {6448-6449},
  doi       = {10.24963/IJCAI.2019/913},
  url       = {https://mlanthology.org/ijcai/2019/lin2019ijcai-split/}
}