Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction
Abstract
This paper introduces and evaluates a new class of knowledge model, the recursive Bayesian multinet (RBMN), which encodes the joint probability distribution of a given database. RBMNs extend Bayesian networks (BNs) as well as partitional clustering systems. Briefly, a RBMN is a decision tree with component BNs at the leaves. A RBMN is learnt using a greedy, heuristic approach akin to that used by many supervised decision tree learners, but where BNs are learnt at leaves using constructive induction. A key idea is to treat expected data as real data. This allows us to complete the database and to take advantage of a closed form for the marginal likelihood of the expected complete data that factorizes into separate marginal likelihoods for each family (a node and its parents). Our approach is evaluated on synthetic and real-world databases.
Cite
Text
Peña et al. "Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction." Machine Learning, 2002. doi:10.1023/A:1013683712412Markdown
[Peña et al. "Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction." Machine Learning, 2002.](https://mlanthology.org/mlj/2002/pena2002mlj-learning/) doi:10.1023/A:1013683712412BibTeX
@article{pena2002mlj-learning,
title = {{Learning Recursive Bayesian Multinets for Data Clustering by Means of Constructive Induction}},
author = {Peña, José M. and Lozano, José Antonio and Larrañaga, Pedro},
journal = {Machine Learning},
year = {2002},
pages = {63-89},
doi = {10.1023/A:1013683712412},
volume = {47},
url = {https://mlanthology.org/mlj/2002/pena2002mlj-learning/}
}