On Predictive Classification of Binary Vectors

Abstract

The problem of rational classification of a database of binary vectors is analyzed by means of a family of Bayesian predictive distributions on the binary hypercube. The general notion of predictive classification was probably first discussed by S. Geisser. The predictive distributions are expressed in terms of a finite number observables based on a given set of binary vectors (predictors or centroids) representing a system of classes and an entropy-maximizing family of probability distributions. We derive the (non-probabilistic) criterion of maximal predictive classification due to J . Gower (1974) as a special case of a Bayesian predictive classification. The notion of a predictive distribution will be related to stochastic complexity of a set of data with respect to a family of statistical distributions. An application to bacterial identification will be presented using a database of Enterobacteriaceae as in Gyllenberg (1996 c). A framework for the analysis is provided by a theorem about the merging of opinions due to Blackwell and Dubins (1962). We prove certain results about the asymptotic properties of the predictive learning process.

Cite

Text

Gyllenberg and Koski. "On Predictive Classification of Binary Vectors." Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics, 1997.

Markdown

[Gyllenberg and Koski. "On Predictive Classification of Binary Vectors." Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics, 1997.](https://mlanthology.org/aistats/1997/gyllenberg1997aistats-predictive/)

BibTeX

@inproceedings{gyllenberg1997aistats-predictive,
  title     = {{On Predictive Classification of Binary Vectors}},
  author    = {Gyllenberg, Mats and Koski, Timo},
  booktitle = {Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics},
  year      = {1997},
  pages     = {239-242},
  volume    = {R1},
  url       = {https://mlanthology.org/aistats/1997/gyllenberg1997aistats-predictive/}
}