Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction

Abstract

This paper introduces and evaluates a new class of knowledge model, the recursive Bayesian multinet (RBMN), which encodes the joint probability distribution of a given database. RBMNs extend Bayesian networks (BNs) as well as partitional clustering systems. Briefly, a RBMN is a decision tree with component BNs at the leaves. A RBMN is learnt using a greedy, heuristic approach akin to that used by many supervised decision tree learners, but where BNs are learnt at leaves using constructive induction. A key idea is to treat expected data as real data. This allows us to complete the database and to take advantage of a closed form for the marginal likelihood of the expected complete data that factorizes into separate marginal likelihoods for each family (a node and its parents). Our approach is evaluated on synthetic and real-world databases.

Cite

Text

Peña et al. "Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction." Machine Learning, 2002. doi:10.1023/A:1013683712412

Markdown

[Peña et al. "Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction." Machine Learning, 2002.](https://mlanthology.org/mlj/2002/pena2002mlj-learning/) doi:10.1023/A:1013683712412

BibTeX

@article{pena2002mlj-learning,
  title     = {{Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction}},
  author    = {Peña, José M. and Lozano, José Antonio and Larrañaga, Pedro},
  journal   = {Machine Learning},
  year      = {2002},
  pages     = {63-89},
  doi       = {10.1023/A:1013683712412},
  volume    = {47},
  url       = {https://mlanthology.org/mlj/2002/pena2002mlj-learning/}
}