RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning

Yoon, Kanghoon; Kim, Kibum; Jeon, Jaehyeong; In, Yeonjun; Kim, Donghyun; Park, Chanyoung

doi:10.1609/AAAI.V39I9.33036

RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning

Kanghoon Yoon, Kibum Kim, Jaehyeong Jeon, Yeonjun In, Donghyun Kim, Chanyoung Park

AAAI 2025 pp. 9562-9570

doi:10.1609/AAAI.V39I9.33036 /aaai/2025/yoon2025aaai-ra/

Abstract

Scene Graph Generation (SGG) research has suffered from two fundamental challenges: the long-tailed predicate distribution and semantic ambiguity between predicates. These challenges lead to a bias towards head predicates in SGG models, favoring dominant general predicates while overlooking fine-grained predicates. In this paper, we address the challenges of SGG by framing it as multi-label classification problem with partial annotation, where relevant labels of fine-grained predicates are missing. Under the new frame, we propose Retrieval-Augmented Scene Graph Generation (RA-SGG), which identifies potential instances to be multilabeled and enriches the single-label with multi-labels that are semantically similar to the original label by retrieving relevant samples from our established memory bank. Based on augmented relations (i.e., discovered multi-labels), we apply multi-prototype learning to train our SGG model. Several comprehensive experiments have demonstrated that RASGG outperforms state-of-the-art baselines by up to 3.6% on VG and 5.9% on GQA, particularly in terms of F@K, showing that RA-SGG effectively alleviates the issue of biased prediction caused by the long-tailed distribution and semantic ambiguity of predicates.

PDF AAAI Semantic Scholar

Cite

Text

Yoon et al. "RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.33036

Markdown

[Yoon et al. "RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/yoon2025aaai-ra/) doi:10.1609/AAAI.V39I9.33036

BibTeX

@inproceedings{yoon2025aaai-ra,
  title     = {{RA-SGG: Retrieval-Augmented Scene Graph Generation Framework via Multi-Prototype Learning}},
  author    = {Yoon, Kanghoon and Kim, Kibum and Jeon, Jaehyeong and In, Yeonjun and Kim, Donghyun and Park, Chanyoung},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {9562-9570},
  doi       = {10.1609/AAAI.V39I9.33036},
  url       = {https://mlanthology.org/aaai/2025/yoon2025aaai-ra/}
}