Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

Abstract

To generate "accurate" scene graphs, almost all exist-ing methods predict pairwise relationships in a determin-istic manner. However, we argue that visual relationshipsare often semantically ambiguous. Specifically, inspired bylinguistic knowledge, we classify the ambiguity into threetypes: Synonymy Ambiguity, Hyponymy Ambiguity, andMulti-view Ambiguity. The ambiguity naturally leads to theissue ofimplicit multi-label, motivating the need for diversepredictions. In this work, we propose a novel plug-and-play Probabilistic Uncertainty Modeling (PUM) module. Itmodels each union region as a Gaussian distribution, whosevariance measures the uncertainty of the corresponding vi-sual content. Compared to the conventional determinis-tic methods, such uncertainty modeling brings stochasticityof feature representation, which naturally enables diversepredictions. As a byproduct, PUM also manages to covermore fine-grained relationships and thus alleviates the is-sue of bias towards frequent relationships. Extensive exper-iments on the large-scale Visual Genome benchmark showthat combining PUM with newly proposed ResCAGCN canachieve state-of-the-art performances, especially under themean recall metric. Furthermore, we show the universal ef-fectiveness of PUM by plugging it into some existing modelsand provide insightful analysis of its ability to generate di-verse yet plausible visual relationships.

Cite

Text

Yang et al. "Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01234

Markdown

[Yang et al. "Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/yang2021cvpr-probabilistic/) doi:10.1109/CVPR46437.2021.01234

BibTeX

@inproceedings{yang2021cvpr-probabilistic,
  title     = {{Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation}},
  author    = {Yang, Gengcong and Zhang, Jingyi and Zhang, Yong and Wu, Baoyuan and Yang, Yujiu},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {12527-12536},
  doi       = {10.1109/CVPR46437.2021.01234},
  url       = {https://mlanthology.org/cvpr/2021/yang2021cvpr-probabilistic/}
}