Investigation and Reduction of Discretization Variance in Decision Tree Induction

Geurts, Pierre; Wehenkel, Louis

doi:10.1007/3-540-45164-1_17

Investigation and Reduction of Discretization Variance in Decision Tree Induction

Pierre Geurts, Louis Wehenkel

ECML-PKDD 2000 pp. 162-170

doi:10.1007/3-540-45164-1_17 /ecmlpkdd/2000/geurts2000ecml-investigation/

Abstract

This paper focuses on the variance introduced by the discretization techniques used to handle continuous attributes in decision tree induction. Different discretization procedures are first studied empirically, then means to reduce the discretization variance are proposed. The experiment shows that discretization variance is large and that it is possible to reduce it significantly without notable computational costs. The resulting variance reduction mainly improves interpretability and stability of decision trees, and marginally their accuracy.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Geurts and Wehenkel. "Investigation and Reduction of Discretization Variance in Decision Tree Induction." European Conference on Machine Learning, 2000. doi:10.1007/3-540-45164-1_17

Markdown

[Geurts and Wehenkel. "Investigation and Reduction of Discretization Variance in Decision Tree Induction." European Conference on Machine Learning, 2000.](https://mlanthology.org/ecmlpkdd/2000/geurts2000ecml-investigation/) doi:10.1007/3-540-45164-1_17

BibTeX

@inproceedings{geurts2000ecml-investigation,
  title     = {{Investigation and Reduction of Discretization Variance in Decision Tree Induction}},
  author    = {Geurts, Pierre and Wehenkel, Louis},
  booktitle = {European Conference on Machine Learning},
  year      = {2000},
  pages     = {162-170},
  doi       = {10.1007/3-540-45164-1_17},
  url       = {https://mlanthology.org/ecmlpkdd/2000/geurts2000ecml-investigation/}
}