Fundamental Limits of Budget-Fidelity Trade-Off in Label Crowdsourcing

Abstract

Digital crowdsourcing (CS) is a modern approach to perform certain large projects using small contributions of a large crowd. In CS, a taskmaster typically breaks down the project into small batches of tasks and assigns them to so-called workers with imperfect skill levels. The crowdsourcer then collects and analyzes the results for inference and serving the purpose of the project. In this work, the CS problem, as a human-in-the-loop computation problem, is modeled and analyzed in an information theoretic rate-distortion framework. The purpose is to identify the ultimate fidelity that one can achieve by any form of query from the crowd and any decoding (inference) algorithm with a given budget. The results are established by a joint source channel (de)coding scheme, which represent the query scheme and inference, over parallel noisy channels, which model workers with imperfect skill levels. We also present and analyze a query scheme dubbed k-ary incidence coding and study optimized query pricing in this setting.

Cite

Text

Lahouti and Hassibi. "Fundamental Limits of Budget-Fidelity Trade-Off in Label Crowdsourcing." Neural Information Processing Systems, 2016.

Markdown

[Lahouti and Hassibi. "Fundamental Limits of Budget-Fidelity Trade-Off in Label Crowdsourcing." Neural Information Processing Systems, 2016.](https://mlanthology.org/neurips/2016/lahouti2016neurips-fundamental/)

BibTeX

@inproceedings{lahouti2016neurips-fundamental,
  title     = {{Fundamental Limits of Budget-Fidelity Trade-Off in Label Crowdsourcing}},
  author    = {Lahouti, Farshad and Hassibi, Babak},
  booktitle = {Neural Information Processing Systems},
  year      = {2016},
  pages     = {5058-5066},
  url       = {https://mlanthology.org/neurips/2016/lahouti2016neurips-fundamental/}
}