Aggregating Crowd Wisdom with Side Information via a Clustering-Based Label-Aware Autoencoder

Abstract

Aggregating crowd wisdom infers true labels for objects, from multiple noisy labels provided by various sources. Besides labels from sources, side information such as object features is also introduced to achieve higher inference accuracy. Usually, the learning-from-crowds framework is adopted. However, the framework considers each object in isolation and does not make full use of object features to overcome label noise. In this paper, we propose a clustering-based label-aware autoencoder (CLA) to alleviate label noise. CLA utilizes clusters to gather objects with similar features and exploits clustering to infer true labels, by constructing a novel deep generative process to simultaneously generate object features and source labels from clusters. For model inference, CLA extends the framework of variational autoencoders and utilizes maximizing a posteriori (MAP) estimation, which prevents the model from overfitting and trivial solutions. Experiments on real-world tasks demonstrate the significant improvement of CLA compared with the state-of-the-art aggregation algorithms.

Cite

Text

Yin et al. "Aggregating Crowd Wisdom with Side Information via a Clustering-Based Label-Aware Autoencoder." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/214

Markdown

[Yin et al. "Aggregating Crowd Wisdom with Side Information via a Clustering-Based Label-Aware Autoencoder." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/yin2020ijcai-aggregating/) doi:10.24963/IJCAI.2020/214

BibTeX

@inproceedings{yin2020ijcai-aggregating,
  title     = {{Aggregating Crowd Wisdom with Side Information via a Clustering-Based Label-Aware Autoencoder}},
  author    = {Yin, Li'ang and Liu, Yunfei and Zhang, Weinan and Yu, Yong},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {1542-1548},
  doi       = {10.24963/IJCAI.2020/214},
  url       = {https://mlanthology.org/ijcai/2020/yin2020ijcai-aggregating/}
}