Exploiting the Cost (In)sensitivity of Decision Tree Splitting Criteria

Abstract

This paper investigates how the splitting criteria and pruning methods of decision tree learning algorithms are influenced by misclassification costs or changes to the class distribution. Splitting criteria that are relatively insensitive to costs (class distributions) are found to perform as well as or better than, in terms of expected misclassification cost, splitting criteria that are cost sensitive. Consequently there are two opposite ways of dealing with imbalance. One is to combine a costinsensitive splitting criterion with a cost insensitive pruning method to produce a decision tree algorithm little affected by cost or prior class distribution. The other is to grow a cost-independent tree which is then pruned in a cost-sensitive manner. 1. Introduction When applying machine learning to real world classification problems two complications that often arise are imbalanced classes (one class occurs much more often than the other (Kubat et al., 1998; Ezawa et al., 1...

Cite

Text

Drummond and Holte. "Exploiting the Cost (In)sensitivity of Decision Tree Splitting Criteria." International Conference on Machine Learning, 2000.

Markdown

[Drummond and Holte. "Exploiting the Cost (In)sensitivity of Decision Tree Splitting Criteria." International Conference on Machine Learning, 2000.](https://mlanthology.org/icml/2000/drummond2000icml-exploiting/)

BibTeX

@inproceedings{drummond2000icml-exploiting,
  title     = {{Exploiting the Cost (In)sensitivity of Decision Tree Splitting Criteria}},
  author    = {Drummond, Chris and Holte, Robert C.},
  booktitle = {International Conference on Machine Learning},
  year      = {2000},
  pages     = {239-246},
  url       = {https://mlanthology.org/icml/2000/drummond2000icml-exploiting/}
}