On the Challenges of Using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Abstract
Drug dosing is an important application of AI, which can be formulated as a Reinforcement Learning (RL) problem. In this paper, we identify two major challenges of using RL for drug dosing: delayed and prolonged effects of administering medications, which break the Markov assumption of the RL framework. We focus on prolongedness and define PAE-POMDP (Prolonged Action Effect-Partially Observable Markov Decision Process), a subclass of POMDPs in which the Markov assumption does not hold specifically due to prolonged effects of actions. Motivated by the pharmacology literature, we propose a simple and effective approach to converting drug dosing PAE-POMDPs into MDPs, enabling the use of the existing RL algorithms to solve such problems. We validate the proposed approach on a toy task, and a challenging glucose control task, for which we devise a clinically-inspired reward function. Our results demonstrate that: (1) the proposed method to restore the Markov assumption leads to significant improvements over a vanilla baseline; (2) the approach is competitive with recurrent policies which may inherently capture the prolonged affect of actions; (3) it is remarkably more time and memory efficient than the recurrent baseline and hence more suitable for real-time dosing control systems; and (4) it exhibits favourable qualitative behavior in our policy analysis.
Cite
Text
Basu et al. "On the Challenges of Using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I12.26650Markdown
[Basu et al. "On the Challenges of Using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/basu2023aaai-challenges/) doi:10.1609/AAAI.V37I12.26650BibTeX
@inproceedings{basu2023aaai-challenges,
title = {{On the Challenges of Using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects}},
author = {Basu, Sumana and Legault, Marc-André and Romero-Soriano, Adriana and Precup, Doina},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2023},
pages = {14102-14109},
doi = {10.1609/AAAI.V37I12.26650},
url = {https://mlanthology.org/aaai/2023/basu2023aaai-challenges/}
}