Classification of Multi-Labeled Data: A Generative Approach

Abstract

Multi-label classification assigns a data item to one or several classes. This problem of multiple labels arises in fields like acoustic and visual scene analysis, news reports and medical diagnosis. In a generative framework, data with multiple labels can be interpreted as additive mixtures of emissions of the individual sources. We propose a deconvolution approach to estimate the individual contributions of each source to a given data item. Similarly, the distributions of multi-label data are computed based on the source distributions. In experiments with synthetic data, the novel approach is compared to existing models and yields more accurate parameter estimates, higher classification accuracy and ameliorated generalization to previously unseen label sets. These improvements are most pronounced on small training data sets. Also on real world acoustic data, the algorithm outperforms other generative models, in particular on small training data sets.

Cite

Text

Streich and Buhmann. "Classification of Multi-Labeled Data: A Generative Approach." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008. doi:10.1007/978-3-540-87481-2_26

Markdown

[Streich and Buhmann. "Classification of Multi-Labeled Data: A Generative Approach." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008.](https://mlanthology.org/ecmlpkdd/2008/streich2008ecmlpkdd-classification/) doi:10.1007/978-3-540-87481-2_26

BibTeX

@inproceedings{streich2008ecmlpkdd-classification,
  title     = {{Classification of Multi-Labeled Data: A Generative Approach}},
  author    = {Streich, Andreas P. and Buhmann, Joachim M.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2008},
  pages     = {390-405},
  doi       = {10.1007/978-3-540-87481-2_26},
  url       = {https://mlanthology.org/ecmlpkdd/2008/streich2008ecmlpkdd-classification/}
}