Cost Sensitive Evaluation of Instance Hardness in Machine Learning

Abstract

Measuring hardness of individual instances in machine learning contributes to a deeper analysis of learning performance. This work proposes instance hardness measures for binary classification in cost-sensitive scenarios. Here cost curves are generated for each instance, defined as the loss observed for a pool of learning models for that instance along the range of cost proportions. Instance hardness is defined as the area under the cost curves and can be seen as an expected loss of difficulty along cost proportions. Different cost curves were proposed by considering common decision threshold choice methods in literature, thus providing alternative views of instance hardness.

Cite

Text

Prudêncio. "Cost Sensitive Evaluation of Instance Hardness in Machine Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2019. doi:10.1007/978-3-030-46147-8_6

Markdown

[Prudêncio. "Cost Sensitive Evaluation of Instance Hardness in Machine Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2019.](https://mlanthology.org/ecmlpkdd/2019/prudencio2019ecmlpkdd-cost/) doi:10.1007/978-3-030-46147-8_6

BibTeX

@inproceedings{prudencio2019ecmlpkdd-cost,
  title     = {{Cost Sensitive Evaluation of Instance Hardness in Machine Learning}},
  author    = {Prudêncio, Ricardo B. C.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2019},
  pages     = {86-102},
  doi       = {10.1007/978-3-030-46147-8_6},
  url       = {https://mlanthology.org/ecmlpkdd/2019/prudencio2019ecmlpkdd-cost/}
}