Span Model for Open Information Extraction on Accurate Corpus

Zhan, Junlang; Zhao, Hai

doi:10.1609/AAAI.V34I05.6497

Span Model for Open Information Extraction on Accurate Corpus

Junlang Zhan, Hai Zhao

AAAI 2020 pp. 9523-9530

doi:10.1609/AAAI.V34I05.6497 /aaai/2020/zhan2020aaai-span/

Abstract

Open information extraction (Open IE) is a challenging task especially due to its brittle data basis. Most of Open IE systems have to be trained on automatically built corpus and evaluated on inaccurate test set. In this work, we first alleviate this difficulty from both sides of training and test sets. For the former, we propose an improved model design to more sufficiently exploit training dataset. For the latter, we present our accurately re-annotated benchmark test set (Re-OIE6) according to a series of linguistic observation and analysis. Then, we introduce a span model instead of previous adopted sequence labeling formulization for n-ary Open IE. Our newly introduced model achieves new state-of-the-art performance on both benchmark evaluation datasets.

PDF AAAI Semantic Scholar

Cite

Text

Zhan and Zhao. "Span Model for Open Information Extraction on Accurate Corpus." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I05.6497

Markdown

[Zhan and Zhao. "Span Model for Open Information Extraction on Accurate Corpus." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/zhan2020aaai-span/) doi:10.1609/AAAI.V34I05.6497

BibTeX

@inproceedings{zhan2020aaai-span,
  title     = {{Span Model for Open Information Extraction on Accurate Corpus}},
  author    = {Zhan, Junlang and Zhao, Hai},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {9523-9530},
  doi       = {10.1609/AAAI.V34I05.6497},
  url       = {https://mlanthology.org/aaai/2020/zhan2020aaai-span/}
}