A K -norm Pruning Algorithm for Decision Tree Classifiers Based on Error Rate Estimation

Zhong, Mingyu; Georgiopoulos, Michael; Anagnostopoulos, Georgios C.

doi:10.1007/S10994-007-5044-4

A K -norm Pruning Algorithm for Decision Tree Classifiers Based on Error Rate Estimation

Mingyu Zhong, Michael Georgiopoulos, Georgios C. Anagnostopoulos

MLJ 2008 pp. 55-88

doi:10.1007/S10994-007-5044-4 /mlj/2008/zhong2008mlj-norm/

Abstract

Decision trees are well-known and established models for classification and regression. In this paper, we focus on the estimation and the minimization of the misclassification rate of decision tree classifiers. We apply Lidstone’s Law of Succession for the estimation of the class probabilities and error rates. In our work, we take into account not only the expected values of the error rate, which has been the norm in existing research, but also the corresponding reliability (measured by standard deviations) of the error rate. Based on this estimation, we propose an efficient pruning algorithm, called k -norm pruning, that has a clear theoretical interpretation, is easily implemented, and does not require a validation set. Our experiments show that our proposed pruning algorithm produces accurate trees quickly, and compares very favorably with two other well-known pruning algorithms, CCP of CART and EBP of C4.5.

PDF MLJ Semantic Scholar

Cite

Text

Zhong et al. "A K -norm Pruning Algorithm for Decision Tree Classifiers Based on Error Rate Estimation." Machine Learning, 2008. doi:10.1007/S10994-007-5044-4

Markdown

[Zhong et al. "A K -norm Pruning Algorithm for Decision Tree Classifiers Based on Error Rate Estimation." Machine Learning, 2008.](https://mlanthology.org/mlj/2008/zhong2008mlj-norm/) doi:10.1007/S10994-007-5044-4

BibTeX

@article{zhong2008mlj-norm,
  title     = {{A K -norm Pruning Algorithm for Decision Tree Classifiers Based on Error Rate Estimation}},
  author    = {Zhong, Mingyu and Georgiopoulos, Michael and Anagnostopoulos, Georgios C.},
  journal   = {Machine Learning},
  year      = {2008},
  pages     = {55-88},
  doi       = {10.1007/S10994-007-5044-4},
  volume    = {71},
  url       = {https://mlanthology.org/mlj/2008/zhong2008mlj-norm/}
}