Quality Control Attack Schemes in Crowdsourcing
Abstract
An important precondition to build effective AI models is the collection of training data at scale. Crowdsourcing is a popular methodology to achieve this goal. Its adoption introduces novel challenges in data quality control, to deal with under-performing and malicious annotators. One of the most popular quality assurance mechanisms, especially in paid micro-task crowdsourcing, is the use of a small set of pre-annotated tasks as gold standard, to assess in real time the annotators quality. In this paper, we highlight a set of vulnerabilities this scheme suffers: a group of colluding crowd workers can easily implement and deploy a decentralised machine learning inferential system to detect and signal which parts of the task are more likely to be gold questions, making them ineffective as a quality control tool. Moreover, we demonstrate how the most common countermeasures against this attack are ineffective in practical scenarios. The basic architecture of the inferential system is composed of a browser plug-in and an external server where the colluding workers can share information. We implement and validate the attack scheme, by means of experiments on real-world data from a popular crowdsourcing platform.
Cite
Text
Checco et al. "Quality Control Attack Schemes in Crowdsourcing." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/850Markdown
[Checco et al. "Quality Control Attack Schemes in Crowdsourcing." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/checco2019ijcai-quality/) doi:10.24963/IJCAI.2019/850BibTeX
@inproceedings{checco2019ijcai-quality,
title = {{Quality Control Attack Schemes in Crowdsourcing}},
author = {Checco, Alessandro and Bates, Jo and Demartini, Gianluca},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2019},
pages = {6136-6140},
doi = {10.24963/IJCAI.2019/850},
url = {https://mlanthology.org/ijcai/2019/checco2019ijcai-quality/}
}