Quality Control Attack Schemes in Crowdsourcing

Alessandro Checco, Jo Bates, Gianluca Demartini

IJCAI 2019 pp. 6136-6140

doi:10.24963/IJCAI.2019/850 /ijcai/2019/checco2019ijcai-quality/

Abstract

An important precondition to build effective AI models is the collection of training data at scale. Crowdsourcing is a popular methodology to achieve this goal. Its adoption introduces novel challenges in data quality control, to deal with under-performing and malicious annotators. One of the most popular quality assurance mechanisms, especially in paid micro-task crowdsourcing, is the use of a small set of pre-annotated tasks as gold standard, to assess in real time the annotators quality. In this paper, we highlight a set of vulnerabilities this scheme suffers: a group of colluding crowd workers can easily implement and deploy a decentralised machine learning inferential system to detect and signal which parts of the task are more likely to be gold questions, making them ineffective as a quality control tool. Moreover, we demonstrate how the most common countermeasures against this attack are ineffective in practical scenarios. The basic architecture of the inferential system is composed of a browser plug-in and an external server where the colluding workers can share information. We implement and validate the attack scheme, by means of experiments on real-world data from a popular crowdsourcing platform.

PDF IJCAI Semantic Scholar

Cite

Text

Checco et al. "Quality Control Attack Schemes in Crowdsourcing." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/850

Markdown

[Checco et al. "Quality Control Attack Schemes in Crowdsourcing." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/checco2019ijcai-quality/) doi:10.24963/IJCAI.2019/850

BibTeX

@inproceedings{checco2019ijcai-quality,
  title     = {{Quality Control Attack Schemes in Crowdsourcing}},
  author    = {Checco, Alessandro and Bates, Jo and Demartini, Gianluca},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {6136-6140},
  doi       = {10.24963/IJCAI.2019/850},
  url       = {https://mlanthology.org/ijcai/2019/checco2019ijcai-quality/}
}