Bayes Optimal Classification for Decision Trees

Abstract

We present the first algorithm for exact Bayes optimal classification from the hypothesis space of decision trees satisfying leaf constraints. Our contribution is that we reduce this problem to the problem of finding a rule-based classifier with appropriate weights. We show that these rules and weights can be computed in linear time from the output of a modified frequent itemset mining algorithm, which means that we can compute the classifier in practice, despite the exponential worst-case complexity. We perform experiments in which we compare the Bayes optimal predictions with those of the maximum a posteriori hypothesis.

Cite

Text

Nijssen. "Bayes Optimal Classification for Decision Trees." International Conference on Machine Learning, 2008. doi:10.1145/1390156.1390244

Markdown

[Nijssen. "Bayes Optimal Classification for Decision Trees." International Conference on Machine Learning, 2008.](https://mlanthology.org/icml/2008/nijssen2008icml-bayes/) doi:10.1145/1390156.1390244

BibTeX

@inproceedings{nijssen2008icml-bayes,
  title     = {{Bayes Optimal Classification for Decision Trees}},
  author    = {Nijssen, Siegfried},
  booktitle = {International Conference on Machine Learning},
  year      = {2008},
  pages     = {696-703},
  doi       = {10.1145/1390156.1390244},
  url       = {https://mlanthology.org/icml/2008/nijssen2008icml-bayes/}
}