Decision-Focused Model-Based Reinforcement Learning for Reward Transfer

Abhishek Sharma, Sonali Parbhoo, Omer Gottesman, Finale Doshi-Velez

MLHC 2024

/mlhc/2024/sharma2024mlhc-decisionfocused/

Abstract

Model-based reinforcement learning (MBRL) provides a way to learn a transition model of the environment, which can then be used to plan personalized policies for different patient cohorts, and to understand the dynamics involved in the decision-making process. However, standard MBRL algorithms are either sensitive to changes in the reward function or achieve suboptimal performance on the task when the transition model is restricted. Motivated by the need to use simple and interpretable models in critical domains such as healthcare, we propose a novel robust decision-focused (RDF) algorithm that learns a transition model that achieves high returns while being robust to changes in the reward function. We demonstrate our RDF algorithm can be used with several model classes and planning algorithms. We also provide theoretical and empirical envidence, on variety of simulators and real patient data, that RDF can learn simple yet effective models that can be used to plan personalized policies.

PDF MLHC OpenReview Semantic Scholar

Cite

Text

Sharma et al. "Decision-Focused Model-Based Reinforcement Learning for Reward Transfer." Proceedings of the 9th Machine Learning for Healthcare Conference, 2024.

Markdown

[Sharma et al. "Decision-Focused Model-Based Reinforcement Learning for Reward Transfer." Proceedings of the 9th Machine Learning for Healthcare Conference, 2024.](https://mlanthology.org/mlhc/2024/sharma2024mlhc-decisionfocused/)

BibTeX

@inproceedings{sharma2024mlhc-decisionfocused,
  title     = {{Decision-Focused Model-Based Reinforcement Learning for Reward Transfer}},
  author    = {Sharma, Abhishek and Parbhoo, Sonali and Gottesman, Omer and Doshi-Velez, Finale},
  booktitle = {Proceedings of the 9th Machine Learning for Healthcare Conference},
  year      = {2024},
  volume    = {252},
  url       = {https://mlanthology.org/mlhc/2024/sharma2024mlhc-decisionfocused/}
}