Explainable Models via Compression of Tree Ensembles

Abstract

Ensemble models (bagging and gradient-boosting) of relational decision trees have proved to be some of the most effective learning methods in the area of probabilistic logic models (PLMs). While effective, they lose one of the most important benefits of PLMs—interpretability. In this paper we consider the problem of compressing a large set of learned trees into a single explainable model. To this effect, we propose CoTE—Compression of Tree Ensembles—that produces a single small decision list as a compressed representation. CoTE first converts the trees to decision lists and then performs the combination and compression with the aid of the original training set. An experimental evaluation demonstrates the effectiveness of CoTE in several benchmark relational data sets.

Cite

Text

Yan et al. "Explainable Models via Compression of Tree Ensembles." Machine Learning, 2024. doi:10.1007/S10994-023-06463-1

Markdown

[Yan et al. "Explainable Models via Compression of Tree Ensembles." Machine Learning, 2024.](https://mlanthology.org/mlj/2024/yan2024mlj-explainable/) doi:10.1007/S10994-023-06463-1

BibTeX

@article{yan2024mlj-explainable,
  title     = {{Explainable Models via Compression of Tree Ensembles}},
  author    = {Yan, Siwen and Natarajan, Sriraam and Joshi, Saket and Khardon, Roni and Tadepalli, Prasad},
  journal   = {Machine Learning},
  year      = {2024},
  pages     = {1303-1328},
  doi       = {10.1007/S10994-023-06463-1},
  volume    = {113},
  url       = {https://mlanthology.org/mlj/2024/yan2024mlj-explainable/}
}