Submodular Surrogates for Value of Information

Abstract

How should we gather information to make effective decisions? A classical answer to this fundamental problem is given by the decision-theoretic value of information. Unfortunately, optimizing this objective is intractable, and myopic (greedy) approximations are known to perform poorly. In this paper, we introduce DiRECt, an efficient yet near-optimal algorithm for nonmyopically optimizing value of information. Crucially, DiRECt uses a novel surrogate objective that is: (1) aligned with the value of information problem (2) efficient to evaluate and (3) adaptive submodular. This latter property enables us to utilize an efficient greedy optimization while providing strong approximation guarantees. We demonstrate the utility of our approach on four diverse case-studies: touch-based robotic localization, comparison-based preference learning, wild-life conservation management, and preference elicitation in behavioral economics. In the first application, we demonstrate DiRECt in closed-loop on an actual robotic platform.

Cite

Text

Chen et al. "Submodular Surrogates for Value of Information." AAAI Conference on Artificial Intelligence, 2015. doi:10.1609/AAAI.V29I1.9694

Markdown

[Chen et al. "Submodular Surrogates for Value of Information." AAAI Conference on Artificial Intelligence, 2015.](https://mlanthology.org/aaai/2015/chen2015aaai-submodular/) doi:10.1609/AAAI.V29I1.9694

BibTeX

@inproceedings{chen2015aaai-submodular,
  title     = {{Submodular Surrogates for Value of Information}},
  author    = {Chen, Yuxin and Javdani, Shervin and Karbasi, Amin and Bagnell, J. Andrew and Srinivasa, Siddhartha S. and Krause, Andreas},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2015},
  pages     = {3511-3518},
  doi       = {10.1609/AAAI.V29I1.9694},
  url       = {https://mlanthology.org/aaai/2015/chen2015aaai-submodular/}
}