Decision Tree Pruning as a Search in the State Space

Abstract

This paper presents a study of one particular problem of decision tree induction, namely (post-)pruning, with the aim of finding a common framework for the plethora of pruning methods appeared in literature. Given a tree T_max to prune, a state space is defined as the set of all subtrees of T_max to which only one operator, called any-depth branch pruning operator, can be applied in several ways in order to move from one state to another. By introducing an evaluation function f defined on the set of subtrees, the problem of tree pruning can be cast as an optimization problem, and it is also possible to classify each post-pruning method according to both its search strategy and the kind of information exploited by f . Indeed, while some methods use only the training set in order to evaluate the accuracy of a decision tree, other methods exploit an additional pruning set that allows them to get less biased estimates of the predictive accuracy of apruned tree. The introduction of the state space shows that very simple search strategies are used by the postpruning methods considered. Finally, some empirical results allow theoretical observations on strengths and weaknesses of pruning methods to be better understood.

Cite

Text

Esposito et al. "Decision Tree Pruning as a Search in the State Space." European Conference on Machine Learning, 1993. doi:10.1007/3-540-56602-3_135

Markdown

[Esposito et al. "Decision Tree Pruning as a Search in the State Space." European Conference on Machine Learning, 1993.](https://mlanthology.org/ecmlpkdd/1993/esposito1993ecml-decision/) doi:10.1007/3-540-56602-3_135

BibTeX

@inproceedings{esposito1993ecml-decision,
  title     = {{Decision Tree Pruning as a Search in the State Space}},
  author    = {Esposito, Floriana and Malerba, Donato and Semeraro, Giovanni},
  booktitle = {European Conference on Machine Learning},
  year      = {1993},
  pages     = {165-184},
  doi       = {10.1007/3-540-56602-3_135},
  url       = {https://mlanthology.org/ecmlpkdd/1993/esposito1993ecml-decision/}
}