Tackling the Poor Assumptions of Naive Bayes Text Classifiers

Rennie, Jason D. M.; Shih, Lawrence; Teevan, Jaime; Karger, David R.

Tackling the Poor Assumptions of Naive Bayes Text Classifiers

Jason D. M. Rennie, Lawrence Shih, Jaime Teevan, David R. Karger

ICML 2003 pp. 616-623

/icml/2003/rennie2003icml-tackling/

Abstract

Naive Bayes is often used as a baseline text classiffication because it is fast and easy to implement. Its severe assumptions make such efficiency possible but also adversely affect the quality of its results. In this paper we propose simple, heuristic solutions to some the problems with Naive Bayes classifiers, addressing both systemic issues as well as problems that arise because text is not actually generated according to a multinomial model. We find that our simple corrections result in fast algorithm that is competitive with state-of-the-art text classification algorithms such as the Support Vector Machine. ICML Proceedings of the Twentieth International Conference on Machine Learning

PDF ICML Semantic Scholar

Cite

Text

Rennie et al. "Tackling the Poor Assumptions of Naive Bayes Text Classifiers." International Conference on Machine Learning, 2003.

Markdown

[Rennie et al. "Tackling the Poor Assumptions of Naive Bayes Text Classifiers." International Conference on Machine Learning, 2003.](https://mlanthology.org/icml/2003/rennie2003icml-tackling/)

BibTeX

@inproceedings{rennie2003icml-tackling,
  title     = {{Tackling the Poor Assumptions of Naive Bayes Text Classifiers}},
  author    = {Rennie, Jason D. M. and Shih, Lawrence and Teevan, Jaime and Karger, David R.},
  booktitle = {International Conference on Machine Learning},
  year      = {2003},
  pages     = {616-623},
  url       = {https://mlanthology.org/icml/2003/rennie2003icml-tackling/}
}