Differentiable TAN Structure Learning for Bayesian Network Classifiers

Abstract

Learning the structure of Bayesian networks is a difficult combinatorial optimization problem. In this paper, we consider learning of tree-augmented naive Bayes (TAN) structures for Bayesian network classifiers with discrete input features. Instead of performing a combinatorial optimization over the space of possible graph structures, the proposed method learns a distribution over graph structures. After training, we select the most probable structure of this distribution. This allows for a joint training of the Bayesian network parameters along with its TAN structure using gradient-based optimization. The proposed method is agnostic to the specific loss and only requires that it is differentiable. We perform extensive experiments using a hybrid generative-discriminative loss based on the discriminative probabilistic margin. Our method consistently outperforms random TAN structures and Chow-Liu TAN structures.

Cite

Text

Roth and Pernkopf. "Differentiable TAN Structure Learning for Bayesian Network Classifiers." Proceedings of pgm 2020, 2020.

Markdown

[Roth and Pernkopf. "Differentiable TAN Structure Learning for Bayesian Network Classifiers." Proceedings of pgm 2020, 2020.](https://mlanthology.org/pgm/2020/roth2020pgm-differentiable/)

BibTeX

@inproceedings{roth2020pgm-differentiable,
  title     = {{Differentiable TAN Structure Learning for Bayesian Network Classifiers}},
  author    = {Roth, Wolfgang and Pernkopf, Franz},
  booktitle = {Proceedings of pgm 2020},
  year      = {2020},
  pages     = {389-400},
  volume    = {138},
  url       = {https://mlanthology.org/pgm/2020/roth2020pgm-differentiable/}
}