Coding Decision Trees

Abstract

Quinlan and Rivest have suggested a decision-tree inference method using the Minimum Description Length idea. We show that there is an error in their derivation of message lengths, which fortunately has no effect on the final inference. We further suggest two improvements to their coding techniques, one removing an inefficiency in the description of non-binary trees, and one improving the coding of leaves. We argue that these improvements are superior to similarly motivated proposals in the original paper. Empirical tests confirm the good results reported by Quinlan and Rivest, and show our coding proposals to lead to useful improvements in the performance of the method.

Cite

Text

Wallace and Patrick. "Coding Decision Trees." Machine Learning, 1993. doi:10.1023/A:1022646101185

Markdown

[Wallace and Patrick. "Coding Decision Trees." Machine Learning, 1993.](https://mlanthology.org/mlj/1993/wallace1993mlj-coding/) doi:10.1023/A:1022646101185

BibTeX

@article{wallace1993mlj-coding,
  title     = {{Coding Decision Trees}},
  author    = {Wallace, Chris S. and Patrick, Jon D.},
  journal   = {Machine Learning},
  year      = {1993},
  pages     = {7-22},
  doi       = {10.1023/A:1022646101185},
  volume    = {11},
  url       = {https://mlanthology.org/mlj/1993/wallace1993mlj-coding/}
}