Learning Optimal Decision Trees Using Caching Branch-and-Bound Search

Gaël Aglin, Siegfried Nijssen, Pierre Schaus

AAAI 2020 pp. 3146-3153

doi:10.1609/AAAI.V34I04.5711 /aaai/2020/aglin2020aaai-learning/

Abstract

Several recent publications have studied the use of Mixed Integer Programming (MIP) for finding an optimal decision tree, that is, the best decision tree under formal requirements on accuracy, fairness or interpretability of the predictive model. These publications used MIP to deal with the hard computational challenge of finding such trees. In this paper, we introduce a new efficient algorithm, DL8.5, for finding optimal decision trees, based on the use of itemset mining techniques. We show that this new approach outperforms earlier approaches with several orders of magnitude, for both numerical and discrete data, and is generic as well. The key idea underlying this new approach is the use of a cache of itemsets in combination with branch-and-bound search; this new type of cache also stores results for parts of the search space that have been traversed partially.

PDF AAAI Semantic Scholar

Cite

Text

Aglin et al. "Learning Optimal Decision Trees Using Caching Branch-and-Bound Search." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.5711

Markdown

[Aglin et al. "Learning Optimal Decision Trees Using Caching Branch-and-Bound Search." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/aglin2020aaai-learning/) doi:10.1609/AAAI.V34I04.5711

BibTeX

@inproceedings{aglin2020aaai-learning,
  title     = {{Learning Optimal Decision Trees Using Caching Branch-and-Bound Search}},
  author    = {Aglin, Gaël and Nijssen, Siegfried and Schaus, Pierre},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {3146-3153},
  doi       = {10.1609/AAAI.V34I04.5711},
  url       = {https://mlanthology.org/aaai/2020/aglin2020aaai-learning/}
}