An Extension on “Statistical Comparisons of Classifiers over Multiple Data Sets” for All Pairwise Comparisons

Abstract

In a recently published paper in JMLR, Demšar (2006) recommends a set of non-parametric statistical tests and procedures which can be safely used for comparing the performance of classifiers over multiple data sets. After studying the paper, we realize that the paper correctly introduces the basic procedures and some of the most advanced ones when comparing a control method. However, it does not deal with some advanced topics in depth. Regarding these topics, we focus on more powerful proposals of statistical procedures for comparing n × n classifiers. Moreover, we illustrate an easy way of obtaining adjusted and comparable p-values in multiple comparison procedures.

Cite

Text

García and Herrera. "An Extension on “Statistical Comparisons of Classifiers over Multiple Data Sets” for All Pairwise Comparisons." Journal of Machine Learning Research, 2008.

Markdown

[García and Herrera. "An Extension on “Statistical Comparisons of Classifiers over Multiple Data Sets” for All Pairwise Comparisons." Journal of Machine Learning Research, 2008.](https://mlanthology.org/jmlr/2008/garcia2008jmlr-extension/)

BibTeX

@article{garcia2008jmlr-extension,
  title     = {{An Extension on “Statistical Comparisons of Classifiers over Multiple Data Sets” for All Pairwise Comparisons}},
  author    = {García, Salvador and Herrera, Francisco},
  journal   = {Journal of Machine Learning Research},
  year      = {2008},
  pages     = {2677-2694},
  volume    = {9},
  url       = {https://mlanthology.org/jmlr/2008/garcia2008jmlr-extension/}
}