On Discriminative Parameter Learning of Bayesian Network Classifiers
Abstract
We introduce three discriminative parameter learning algorithms for Bayesian network classifiers based on optimizing either the conditional likelihood (CL) or a lower-bound surrogate of the CL. One training procedure is based on the extended Baum-Welch (EBW) algorithm. Similarly, the remaining two approaches iteratively optimize the parameters (initialized to ML) with a 2-step algorithm. In the first step, either the class posterior probabilities or class assignments are determined based on current parameter estimates. Based on these posteriors (class assignment, respectively), the parameters are updated in the second step. We show that one of these algorithms is strongly related to EBW. Additionally, we compare all algorithms to conjugate gradient conditional likelihood (CGCL) parameter optimization [1]. We present classification results for frame- and segment-based phonetic classification and handwritten digit recognition. Discriminative parameter learning shows a significant improvement over generative ML estimation for naive Bayes (NB) and tree augmented naive Bayes (TAN) structures on all data sets. In general, the performance improvement of discriminative parameter learning is large for simple Bayesian network structures which are not optimized for classification.
Cite
Text
Pernkopf and Wohlmayr. "On Discriminative Parameter Learning of Bayesian Network Classifiers." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009. doi:10.1007/978-3-642-04174-7_15Markdown
[Pernkopf and Wohlmayr. "On Discriminative Parameter Learning of Bayesian Network Classifiers." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009.](https://mlanthology.org/ecmlpkdd/2009/pernkopf2009ecmlpkdd-discriminative/) doi:10.1007/978-3-642-04174-7_15BibTeX
@inproceedings{pernkopf2009ecmlpkdd-discriminative,
title = {{On Discriminative Parameter Learning of Bayesian Network Classifiers}},
author = {Pernkopf, Franz and Wohlmayr, Michael},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2009},
pages = {221-237},
doi = {10.1007/978-3-642-04174-7_15},
url = {https://mlanthology.org/ecmlpkdd/2009/pernkopf2009ecmlpkdd-discriminative/}
}