A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

Abstract

We review accuracy estimation methods and compare the two most common methods: crossvalidation and bootstrap. Recent experimental results on artificial data and theoretical results in restricted settings have shown that for selecting a good classifier from a set of classifiers (model selection), ten-fold cross-validation may be better than the more expensiveleaveone -out cross-validation. We report on a largescale experiment---over half a million runs of C4.5 and a Naive-Bayes algorithm---to estimate the effects of different parameters on these algorithms on real-world datasets. For crossvalidation, wevary the number of folds and whether the folds are stratified or not# for bootstrap, wevary the number of bootstrap samples. Our results indicate that for real-word datasets similar to ours, the best method to use for model selection is ten-fold stratified cross validation, even if computation power allows using more folds.

Cite

Text

Kohavi. "A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection." International Joint Conference on Artificial Intelligence, 1995.

Markdown

[Kohavi. "A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection." International Joint Conference on Artificial Intelligence, 1995.](https://mlanthology.org/ijcai/1995/kohavi1995ijcai-study/)

BibTeX

@inproceedings{kohavi1995ijcai-study,
  title     = {{A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection}},
  author    = {Kohavi, Ron},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {1995},
  pages     = {1137-1145},
  url       = {https://mlanthology.org/ijcai/1995/kohavi1995ijcai-study/}
}