Reducing Label Complexity by Learning from Bags
Abstract
We consider a supervised learning setting in which the main cost of learning is the number of training labels and one can obtain a single label for a bag of examples, indicating only if a positive example exists in the bag, as in Multi-Instance Learning. We thus propose to create a training sample of bags, and to use the obtained labels to learn to classify individual examples. We provide a theoretical analysis showing how to select the bag size as a function of the problem parameters, and prove that if the original labels are distributed unevenly, the number of required labels drops considerably when learning from bags. We demonstrate that finding a low-error separating hyperplane from bags is feasible in this setting using a simple iterative procedure similar to latent SVM. Experiments on synthetic and real data sets demonstrate the success of the approach.
Cite
Text
Sabato et al. "Reducing Label Complexity by Learning from Bags." Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010.Markdown
[Sabato et al. "Reducing Label Complexity by Learning from Bags." Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010.](https://mlanthology.org/aistats/2010/sabato2010aistats-reducing/)BibTeX
@inproceedings{sabato2010aistats-reducing,
title = {{Reducing Label Complexity by Learning from Bags}},
author = {Sabato, Sivan and Srebro, Nathan and Tishby, Naftali},
booktitle = {Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics},
year = {2010},
pages = {685-692},
volume = {9},
url = {https://mlanthology.org/aistats/2010/sabato2010aistats-reducing/}
}