Graph R-CNN for Scene Graph Generation

Abstract

We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with the quadratic number of potential relations between objects in an image. We also propose an attentional Graph Convolutional Network (aGCN) that effectively captures contextual information between objects and relations. Finally, we introduce a novel scene graph evaluation metric that is more holistic and realistic than existing metrics. We report state-of-the-art performance on scene graph generation as evaluated using both existing and our proposed metrics.

Cite

Text

Yang et al. "Graph R-CNN for Scene Graph Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01246-5_41

Markdown

[Yang et al. "Graph R-CNN for Scene Graph Generation." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/yang2018eccv-graph/) doi:10.1007/978-3-030-01246-5_41

BibTeX

@inproceedings{yang2018eccv-graph,
  title     = {{Graph R-CNN for Scene Graph Generation}},
  author    = {Yang, Jianwei and Lu, Jiasen and Lee, Stefan and Batra, Dhruv and Parikh, Devi},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2018},
  doi       = {10.1007/978-3-030-01246-5_41},
  url       = {https://mlanthology.org/eccv/2018/yang2018eccv-graph/}
}