The Value of Observation for Monitoring Dynamic Systems

Abstract

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief state might be unknown. In this general setting where the model is (perhaps only slightly) mis-specified, monitoring (and consequently planning) may be impossible as errors might accumulate over time. We provide a new characterization, the \emph{value of observation}, which allows us to bound the error accumulation. The value of observation is a parameter that governs how much information the observation provides. For instance, in Partially Observable MDPs when it is 1 the POMDP is an MDP while for an unobservable Markov Decision Process the parameter is 0. Thus, the new parameter characterizes a spectrum from MDPs to unobservable MDPs depending on the amount of information conveyed in the observations.

Cite

Text

Even-Dar et al. "The Value of Observation for Monitoring Dynamic Systems." International Joint Conference on Artificial Intelligence, 2007.

Markdown

[Even-Dar et al. "The Value of Observation for Monitoring Dynamic Systems." International Joint Conference on Artificial Intelligence, 2007.](https://mlanthology.org/ijcai/2007/evendar2007ijcai-value/)

BibTeX

@inproceedings{evendar2007ijcai-value,
  title     = {{The Value of Observation for Monitoring Dynamic Systems}},
  author    = {Even-Dar, Eyal and Kakade, Sham M. and Mansour, Yishay},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2007},
  pages     = {2474-2479},
  url       = {https://mlanthology.org/ijcai/2007/evendar2007ijcai-value/}
}