Why Discretization Works for Naive Bayesian Classifiers
Abstract
This paper explains why well-known discretization methods, such as entropy-based and ten-bin, work well for naive Bayesian classifiers with continuous variables, regardless of their complexities. These methods usually assume that discretized variables have Dirichlet priors. Since perfect aggregation holds for Dirichlets, we can show that, generally, a wide variety of discretization methods can perform well with insignificant difference. We identify situations where discretization may cause performance degradation and show that they are unlikely to happen for well-known methods. We empirically test our explanation with synthesized and real data sets and obtain confirming results. Our analysis leads to a lazy discretization method that can simplify the training for naive Bayes. This new method can perform as well as well-known methods in our experiment. 1. Introduction Learning a naive Bayesian classifier (a.k.a. naive Bayes) (Langley et al., 1992) from data is an ...
Cite
Text
Hsu et al. "Why Discretization Works for Naive Bayesian Classifiers." International Conference on Machine Learning, 2000.Markdown
[Hsu et al. "Why Discretization Works for Naive Bayesian Classifiers." International Conference on Machine Learning, 2000.](https://mlanthology.org/icml/2000/hsu2000icml-discretization/)BibTeX
@inproceedings{hsu2000icml-discretization,
title = {{Why Discretization Works for Naive Bayesian Classifiers}},
author = {Hsu, Chun-Nan and Huang, Hung-Ju and Wong, Tzu-Tsung},
booktitle = {International Conference on Machine Learning},
year = {2000},
pages = {399-406},
url = {https://mlanthology.org/icml/2000/hsu2000icml-discretization/}
}