Learning to Search Efficiently for Causally Near-Optimal Treatments

Abstract

Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as learning a policy for finding a near-optimal treatment in a minimum number of trials using a causal inference framework. We give a model-based dynamic programming algorithm which learns from observational data while being robust to unmeasured confounding. To reduce time complexity, we suggest a greedy algorithm which bounds the near-optimality constraint. The methods are evaluated on synthetic and real-world healthcare data and compared to model-free reinforcement learning. We find that our methods compare favorably to the model-free baseline while offering a more transparent trade-off between search time and treatment efficacy.

Cite

Text

Håkansson et al. "Learning to Search Efficiently for Causally Near-Optimal Treatments." Neural Information Processing Systems, 2020.

Markdown

[Håkansson et al. "Learning to Search Efficiently for Causally Near-Optimal Treatments." Neural Information Processing Systems, 2020.](https://mlanthology.org/neurips/2020/hakansson2020neurips-learning/)

BibTeX

@inproceedings{hakansson2020neurips-learning,
  title     = {{Learning to Search Efficiently for Causally Near-Optimal Treatments}},
  author    = {Håkansson, Samuel and Lindblom, Viktor and Gottesman, Omer and Johansson, Fredrik D},
  booktitle = {Neural Information Processing Systems},
  year      = {2020},
  url       = {https://mlanthology.org/neurips/2020/hakansson2020neurips-learning/}
}