Prototype-Guided Multimodal Relation Extraction Based on Entity Attributes

Abstract

Multimodal Relation Extraction (MRE) aims to predict relations between head and tail entities based on the context of sentence-image pairs. Most existing MRE methods progressively incorporate textual and visual inputs to dominate the learning process, assuming both contribute significantly to the task. However, the diverse visual appearances and text with ambiguous semantics contain less-informative contexts for the corresponding relation. To tackle these challenges, we highlight the importance of semantically invariant entity attributes that encompass fine-grained categories. Towards this, we propose a novel Prototype-Guided Multimodal Relation Extraction (PG-MRE) framework based on Entity Attributes. Specifically, we first generate detailed entity explanations using Large Language Models (LLMs) to supplement the attribute semantics. Then, the Attribute Prototype Module (APM) refines attribute categories and condenses scattered entity attribute features into cluster-level prototypes. Furthermore, prototype-aligned attribute features guide diverse visual appearance features to produce compact and distinctive multimodal representations in the Relation Prototype Module (RPM). Extensive experiments demonstrate that our method gains superior relation classification capability (especially in scenarios involving various unseen entities), achieving new state-of-the-art performances on MNRE dataset.

Cite

Text

Zhang et al. "Prototype-Guided Multimodal Relation Extraction Based on Entity Attributes." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I24.34795

Markdown

[Zhang et al. "Prototype-Guided Multimodal Relation Extraction Based on Entity Attributes." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-prototype/) doi:10.1609/AAAI.V39I24.34795

BibTeX

@inproceedings{zhang2025aaai-prototype,
  title     = {{Prototype-Guided Multimodal Relation Extraction Based on Entity Attributes}},
  author    = {Zhang, Zefan and Zhang, Weiqi and Li, Yanhui and Bai, Tian},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {26003-26011},
  doi       = {10.1609/AAAI.V39I24.34795},
  url       = {https://mlanthology.org/aaai/2025/zhang2025aaai-prototype/}
}