Tagging like Humans: Diverse and Distinct Image Annotation

Wu, Baoyuan; Chen, Weidong; Sun, Peng; Liu, Wei; Ghanem, Bernard; Lyu, Siwei

doi:10.1109/CVPR.2018.00831

Tagging like Humans: Diverse and Distinct Image Annotation

Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu

CVPR 2018

doi:10.1109/CVPR.2018.00831 /cvpr/2018/wu2018cvpr-tagging/

Abstract

In this work we propose a new automatic image annotation model, dubbed diverse and distinct image annotation (D2IA). The generative model D2IA is inspired by the ensemble of human annotations, which create semantically relevant, yet distinct and diverse tags. In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model. Multiple such tag subsets that cover diverse semantic aspects or diverse semantic levels of the image contents are generated by randomly perturbing the DPP sampling process. We leverage a generative adversarial network (GAN) model to train D2IA. We perform extensive experiments including quantitative and qualitative comparisons, as well as human subject studies, on two benchmark datasets to demonstrate that the proposed model can produce more diverse and distinct tags than the state-of-the-arts.

PDF CVPR Semantic Scholar

Cite

Text

Wu et al. "Tagging like Humans: Diverse and Distinct Image Annotation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00831

Markdown

[Wu et al. "Tagging like Humans: Diverse and Distinct Image Annotation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/wu2018cvpr-tagging/) doi:10.1109/CVPR.2018.00831

BibTeX

@inproceedings{wu2018cvpr-tagging,
  title     = {{Tagging like Humans: Diverse and Distinct Image Annotation}},
  author    = {Wu, Baoyuan and Chen, Weidong and Sun, Peng and Liu, Wei and Ghanem, Bernard and Lyu, Siwei},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00831},
  url       = {https://mlanthology.org/cvpr/2018/wu2018cvpr-tagging/}
}