EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition

Abstract

Active learning is a critical technique for reducing labelling load by selecting the most informative data. Most previous works applied active learning on Named Entity Recognition (token-level task) similar to the text classification (sentence-level task). They failed to consider the heterogeneity of uncertainty within each sentence and required access to the entire sentence for the annotator when labelling. To overcome the mentioned limitations, in this paper, we allow the active learning algorithm to query subsequences within sentences and propose an Entity-Aware Subsequences-based Active Learning (EASAL) that utilizes an effective Head-Tail pointer to query one entity-aware subsequence for each sentence based on BERT. For other tokens outside this subsequence, we randomly select 30% of these tokens to be pseudo-labelled for training together where the model directly predicts their pseudo-labels. Experimental results on both news and biomedical datasets demonstrate the effectiveness of our proposed method. The code is released at https://github.com/lylylylylyly/EASAL.

Cite

Text

Liu et al. "EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I7.26069

Markdown

[Liu et al. "EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/liu2023aaai-easal/) doi:10.1609/AAAI.V37I7.26069

BibTeX

@inproceedings{liu2023aaai-easal,
  title     = {{EASAL: Entity-Aware Subsequence-Based Active Learning for Named Entity Recognition}},
  author    = {Liu, Yang and Hu, Jinpeng and Chen, Zhihong and Wan, Xiang and Chang, Tsung-Hui},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {8897-8905},
  doi       = {10.1609/AAAI.V37I7.26069},
  url       = {https://mlanthology.org/aaai/2023/liu2023aaai-easal/}
}