Robust Learning with Missing Data

Abstract

This paper introduces a new method, called the robust Bayesian estimator (RBE), to learn conditional probability distributions from incomplete data sets. The intuition behind the RBE is that, when no information about the pattern of missing data is available, an incomplete database constrains the set of all possible estimates and this paper provides a characterization of these constraints. An experimental comparison with two popular methods to estimate conditional probability distributions from incomplete data—Gibbs sampling and the EM algorithm—shows a gain in robustness. An application of the RBE to quantify a naive Bayesian classifier from an incomplete data set illustrates its practical relevance.

Cite

Text

Ramoni and Sebastiani. "Robust Learning with Missing Data." Machine Learning, 2001. doi:10.1023/A:1010968702992

Markdown

[Ramoni and Sebastiani. "Robust Learning with Missing Data." Machine Learning, 2001.](https://mlanthology.org/mlj/2001/ramoni2001mlj-robust/) doi:10.1023/A:1010968702992

BibTeX

@article{ramoni2001mlj-robust,
  title     = {{Robust Learning with Missing Data}},
  author    = {Ramoni, Marco and Sebastiani, Paola},
  journal   = {Machine Learning},
  year      = {2001},
  pages     = {147-170},
  doi       = {10.1023/A:1010968702992},
  volume    = {45},
  url       = {https://mlanthology.org/mlj/2001/ramoni2001mlj-robust/}
}