A Blended Metric for Multi-Label Optimisation and Evaluation
Abstract
In multi-label classification, a large number of evaluation metrics exist, for example Hamming loss, exact match, and Jaccard similarity – but there are many more. In fact, there remains an apparent uncertainty in the multi-label literature about which metrics should be considered and when and how to optimise them. This has given rise to a proliferation of metrics, with some papers carrying out empirical evaluations under 10 or more different metrics in order to analyse method performance. We argue that further understanding of underlying mechanisms is necessary. In this paper we tackle the challenge of having a clearer view of evaluation strategies. We present a blended loss function. This function allows us to evaluate under the properties of several major loss functions with a single parameterisation. Furthermore we demonstrate the successful use of this metric as a surrogate loss for other metrics. We offer experimental investigation and theoretical backing to demonstrate that optimising this surrogate loss offers best results for several different metrics than optimising the metrics directly. It simplifies and provides insight to the task of evaluating multi-label prediction methodologies. Data related to this paper are available at: http://mulan.sourceforge.net/datasets-mlc.html , https://sourceforge.net/projects/meka/files/Datasets/ , http://www.ces.clemson.edu/~ahoover/stare/ .
Cite
Text
Park and Read. "A Blended Metric for Multi-Label Optimisation and Evaluation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10925-7_44Markdown
[Park and Read. "A Blended Metric for Multi-Label Optimisation and Evaluation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/park2018ecmlpkdd-blended/) doi:10.1007/978-3-030-10925-7_44BibTeX
@inproceedings{park2018ecmlpkdd-blended,
title = {{A Blended Metric for Multi-Label Optimisation and Evaluation}},
author = {Park, Laurence A. F. and Read, Jesse},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2018},
pages = {719-734},
doi = {10.1007/978-3-030-10925-7_44},
url = {https://mlanthology.org/ecmlpkdd/2018/park2018ecmlpkdd-blended/}
}