On Discriminative Bayesian Network Classifiers and Logistic Regression
Abstract
Discriminative learning of the parameters in the naive Bayes model is known to be equivalent to a logistic regression problem. Here we show that the same fact holds for much more general Bayesian network models, as long as the corresponding network structure satisfies a certain graph-theoretic property. The property holds for naive Bayes but also for more complex structures such as tree-augmented naive Bayes (TAN) as well as for mixed diagnostic-discriminative structures. Our results imply that for networks satisfying our property, the conditional likelihood cannot have local maxima so that the global maximum can be found by simple local optimization methods. We also show that if this property does not hold, then in general the conditional likelihood can have local, non-global maxima. We illustrate our theoretical results by empirical experiments with local optimization in a conditional naive Bayes model. Furthermore, we provide a heuristic strategy for pruning the number of parameters and relevant features in such models. For many data sets, we obtain good results with heavily pruned submodels containing many fewer parameters than the original naive Bayes model.
Cite
Text
Roos et al. "On Discriminative Bayesian Network Classifiers and Logistic Regression." Machine Learning, 2005. doi:10.1007/S10994-005-0471-6Markdown
[Roos et al. "On Discriminative Bayesian Network Classifiers and Logistic Regression." Machine Learning, 2005.](https://mlanthology.org/mlj/2005/roos2005mlj-discriminative/) doi:10.1007/S10994-005-0471-6BibTeX
@article{roos2005mlj-discriminative,
title = {{On Discriminative Bayesian Network Classifiers and Logistic Regression}},
author = {Roos, Teemu and Wettig, Hannes and Grünwald, Peter and Myllymäki, Petri and Tirri, Henry},
journal = {Machine Learning},
year = {2005},
pages = {267-296},
doi = {10.1007/S10994-005-0471-6},
volume = {59},
url = {https://mlanthology.org/mlj/2005/roos2005mlj-discriminative/}
}