Model Averaging with Bayesian Network Classifiers
Abstract
This paper considers the problem of performing classification by model-averaging over a class of discrete Bayesian network structures consistent with a partial ordering and with bounded in-degree $k .$ We show that for $N$ nodes this class contains in the worst-case at least $\Omega\left(\left(\begin{array}{c}N/2 \\{k}\end{array}\right)^{N / 2} \right)$ distinct network structures, but we show that this summation can be performed in $O\left(\left(\begin{array}{c}N \\{k}\end{array}\right) \cdot N\right)$ time. We use this fact to show that it is possible to efficiently construct a single directed acyclic graph (DAG) whose predictions approximate those of exact model-averaging over this class, allowing approximate model-averaged predictions to be performed in $O(N)$ time. We evaluate the procedure in a supervised classification context, and show empirically that this technique can be beneficial for classification even when the generating distribution is not a member of the class being averaged over, and we characterize the performance over several parameters on simulated and real-world data.
Cite
Text
Dash and Cooper. "Model Averaging with Bayesian Network Classifiers." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.Markdown
[Dash and Cooper. "Model Averaging with Bayesian Network Classifiers." Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003.](https://mlanthology.org/aistats/2003/dash2003aistats-model/)BibTeX
@inproceedings{dash2003aistats-model,
title = {{Model Averaging with Bayesian Network Classifiers}},
author = {Dash, Denver and Cooper, Gregory F.},
booktitle = {Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics},
year = {2003},
pages = {72-79},
volume = {R4},
url = {https://mlanthology.org/aistats/2003/dash2003aistats-model/}
}