CrossNER: Evaluating Cross-Domain Named Entity Recognition

Liu, Zihan; Xu, Yan; Yu, Tiezheng; Dai, Wenliang; Ji, Ziwei; Cahyawijaya, Samuel; Madotto, Andrea; Fung, Pascale

doi:10.1609/AAAI.V35I15.17587

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Zihan Liu, Yan Xu, Tiezheng Yu, Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung

AAAI 2021 pp. 13452-13460

doi:10.1609/AAAI.V35I15.17587 /aaai/2021/liu2021aaai-crossner/

Abstract

Cross-domain named entity recognition (NER) models are able to cope with the scarcity issue of NER samples in target domains. However, most of the existing NER benchmarks lack domain-specialized entity types or do not focus on a certain domain, leading to a less effective cross-domain evaluation. To address these obstacles, we introduce a cross-domain NER dataset (CrossNER), a fully-labeled collection of NER data spanning over five diverse domains with specialized entity categories for different domains. Additionally, we also provide a domain-related corpus since using it to continue pre-training language models (domain-adaptive pre-training) is effective for the domain adaptation. We then conduct comprehensive experiments to explore the effectiveness of leveraging different levels of the domain corpus and pre-training strategies to do domain-adaptive pre-training for the cross-domain task. Results show that focusing on the fractional corpus containing domain-specialized entities and utilizing a more challenging pre-training strategy in domain-adaptive pre-training are beneficial for the NER domain adaptation, and our proposed method can consistently outperform existing cross-domain NER baselines. Nevertheless, experiments also illustrate the challenge of this cross-domain NER task. We hope that our dataset and baselines will catalyze research in the NER domain adaptation area. The code and data are available at https://github.com/zliucr/CrossNER.

PDF AAAI Semantic Scholar

Cite

Text

Liu et al. "CrossNER: Evaluating Cross-Domain Named Entity Recognition." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I15.17587

Markdown

[Liu et al. "CrossNER: Evaluating Cross-Domain Named Entity Recognition." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/liu2021aaai-crossner/) doi:10.1609/AAAI.V35I15.17587

BibTeX

@inproceedings{liu2021aaai-crossner,
  title     = {{CrossNER: Evaluating Cross-Domain Named Entity Recognition}},
  author    = {Liu, Zihan and Xu, Yan and Yu, Tiezheng and Dai, Wenliang and Ji, Ziwei and Cahyawijaya, Samuel and Madotto, Andrea and Fung, Pascale},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {13452-13460},
  doi       = {10.1609/AAAI.V35I15.17587},
  url       = {https://mlanthology.org/aaai/2021/liu2021aaai-crossner/}
}