An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction

Abstract

In this paper, we propose a novel method for joint entity and relation extract from unstructured text by framing it as a conditional sequence generation problem. In contrast to conventional generative information extraction models that generate text as output, our approach generates a linearized graph where nodes represent text spans while the edges/relation of the graph represent relation triples. For that, our method employs a transformer encoder-decoder architecture with pointing mechanism on a dynamic vocabulary of spans and relation types. Particularly, our model can capture the structural characteristics and boundaries of entities and relations through span representation, while simultaneously grounding the generated output in the original text thanks to pointer mechanism. Evaluation on benchmark datasets validates the effectiveness of our approach, demonstrating state-of-the-art results in entity and relation extraction tasks.

Cite

Text

Zaratiana et al. "An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction." ICML 2023 Workshops: SPIGM, 2023.

Markdown

[Zaratiana et al. "An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction." ICML 2023 Workshops: SPIGM, 2023.](https://mlanthology.org/icmlw/2023/zaratiana2023icmlw-autoregressive/)

BibTeX

@inproceedings{zaratiana2023icmlw-autoregressive,
  title     = {{An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction}},
  author    = {Zaratiana, Urchade and Tomeh, Nadi and Holat, Pierre and Charnois, Thierry},
  booktitle = {ICML 2023 Workshops: SPIGM},
  year      = {2023},
  url       = {https://mlanthology.org/icmlw/2023/zaratiana2023icmlw-autoregressive/}
}