Learning and Evaluating Classifiers Under Sample Selection Bias

Abstract

Classifier learning methods commonly assume that the training data consist ofrandomly drawn examples from the same distribution as the test examples aboutwhich the learned model is expected to make predictions. In many practical situations, however, this assumption isviolated, in a problem known in econometrics as sample selection bias. In thispaper, we formalize the sample selection bias problem in machine learning terms and study analytically and experimentally how anumber of well-known classifier learning methods are affected by it. We alsopresent a bias correction method that is particularly useful for classifierevaluation under sample selection bias.

Cite

Text

Zadrozny. "Learning and Evaluating Classifiers Under Sample Selection Bias." International Conference on Machine Learning, 2004. doi:10.1145/1015330.1015425

Markdown

[Zadrozny. "Learning and Evaluating Classifiers Under Sample Selection Bias." International Conference on Machine Learning, 2004.](https://mlanthology.org/icml/2004/zadrozny2004icml-learning/) doi:10.1145/1015330.1015425

BibTeX

@inproceedings{zadrozny2004icml-learning,
  title     = {{Learning and Evaluating Classifiers Under Sample Selection Bias}},
  author    = {Zadrozny, Bianca},
  booktitle = {International Conference on Machine Learning},
  year      = {2004},
  doi       = {10.1145/1015330.1015425},
  url       = {https://mlanthology.org/icml/2004/zadrozny2004icml-learning/}
}