Estimating Labels from Label Proportions

Abstract

Consider the following problem: given sets of unlabeled observations, each set with known label proportions, predict the labels of another set of observations, also with known label proportions. This problem appears in areas like e-commerce, spam filtering and improper content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform convergence sense. Experiments show that our method works well in practice.

Cite

Text

Quadrianto et al. "Estimating Labels from Label Proportions." International Conference on Machine Learning, 2008. doi:10.1145/1390156.1390254

Markdown

[Quadrianto et al. "Estimating Labels from Label Proportions." International Conference on Machine Learning, 2008.](https://mlanthology.org/icml/2008/quadrianto2008icml-estimating/) doi:10.1145/1390156.1390254

BibTeX

@inproceedings{quadrianto2008icml-estimating,
  title     = {{Estimating Labels from Label Proportions}},
  author    = {Quadrianto, Novi and Smola, Alexander J. and Caetano, Tibério S. and Le, Quoc V.},
  booktitle = {International Conference on Machine Learning},
  year      = {2008},
  pages     = {776-783},
  doi       = {10.1145/1390156.1390254},
  url       = {https://mlanthology.org/icml/2008/quadrianto2008icml-estimating/}
}