Learning from Crowds via Joint Probabilistic Matrix Factorization and Clustering in Latent Space
Abstract
Learning from noisy labels is getting trendy in the era of big data. However, in crowdsourcing practice, it is still a challenging task to extract ground truth labels from noisy labels obtained from crowds. In this paper, we propose a latent variable model built on probabilistic logistic matrix factorization model and classical Gaussian mixture model for inferring ground truth labels from noisy, crowdsourced ones. The proposed model incorporates item heterogeneity in contrast to previous works and allows for vector space embeddings of both items and worker labels. Moreover, we derive a tractable mean-field variational inference algorithm to approximate the model posterior. Meanwhile, related MAP approximation problem to the model posterior is also investigated to identify links to existing works. Empirically, we demonstrate that the proposed method achieves good inference accuracy while preserving meaningful uncertainty measures in the embeddings, and therefore better reflects the intrinsic structure of data.
Cite
Text
Yao et al. "Learning from Crowds via Joint Probabilistic Matrix Factorization and Clustering in Latent Space." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2020. doi:10.1007/978-3-030-67667-4_33Markdown
[Yao et al. "Learning from Crowds via Joint Probabilistic Matrix Factorization and Clustering in Latent Space." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2020.](https://mlanthology.org/ecmlpkdd/2020/yao2020ecmlpkdd-learning/) doi:10.1007/978-3-030-67667-4_33BibTeX
@inproceedings{yao2020ecmlpkdd-learning,
title = {{Learning from Crowds via Joint Probabilistic Matrix Factorization and Clustering in Latent Space}},
author = {Yao, Wuguannan and Lee, Wonjung and Wang, Junhui},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2020},
pages = {546-561},
doi = {10.1007/978-3-030-67667-4_33},
url = {https://mlanthology.org/ecmlpkdd/2020/yao2020ecmlpkdd-learning/}
}