Multi-Modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance

Abstract

Multi-modal named entity recognition (MNER) aims to discover named entities in free text and classify them into pre-defined types with images. However, dominant MNER models do not fully exploit fine-grained semantic correspondences between semantic units of different modalities, which have the potential to refine multi-modal representation learning. To deal with this issue, we propose a unified multi-modal graph fusion (UMGF) approach for MNER. Specifically, we first represent the input sentence and image using a unified multi-modal graph, which captures various semantic relationships between multi-modal semantic units (words and visual objects). Then, we stack multiple graph-based multi-modal fusion layers that iteratively perform semantic interactions to learn node representations. Finally, we achieve an attention-based multi-modal representation for each word and perform entity labeling with a CRF decoder. Experimentation on the two benchmark datasets demonstrates the superiority of our MNER model.

Cite

Text

Zhang et al. "Multi-Modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I16.17687

Markdown

[Zhang et al. "Multi-Modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/zhang2021aaai-multi-a/) doi:10.1609/AAAI.V35I16.17687

BibTeX

@inproceedings{zhang2021aaai-multi-a,
  title     = {{Multi-Modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance}},
  author    = {Zhang, Dong and Wei, Suzhong and Li, Shoushan and Wu, Hanqian and Zhu, Qiaoming and Zhou, Guodong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {14347-14355},
  doi       = {10.1609/AAAI.V35I16.17687},
  url       = {https://mlanthology.org/aaai/2021/zhang2021aaai-multi-a/}
}