Transition-Based Neural Word Segmentation Using Word-Level Features

Zhang, Meishan; Zhang, Yue; Fu, Guohong

doi:10.1613/JAIR.1.11266

Transition-Based Neural Word Segmentation Using Word-Level Features

Meishan Zhang, Yue Zhang, Guohong Fu

JAIR 2018 pp. 923-953

doi:10.1613/JAIR.1.11266 /jair/2018/zhang2018jair-transitionbased/

Abstract

   Character-based and word-based methods are two different solutions for Chinese word segmentation, the former exploiting sequence labeling models over characters and the latter using word-level features. Neural models have been exploited for character-based Chinese word segmentation, giving high accuracies by making use of external character embeddings, yet requiring less feature engineering. In this paper, we study a neural model for word-based Chinese word segmentation, by replacing the manually-designed discrete features with neural features in a transition-based word segmentation framework. Experimental results demonstrate that word features lead to comparable performance to the best systems in the literature, and a further combination of discrete and neural features obtains top accuracies on several benchmarks.  

PDF JAIR Semantic Scholar

Cite

Text

Zhang et al. "Transition-Based Neural Word Segmentation Using Word-Level Features." Journal of Artificial Intelligence Research, 2018. doi:10.1613/JAIR.1.11266

Markdown

[Zhang et al. "Transition-Based Neural Word Segmentation Using Word-Level Features." Journal of Artificial Intelligence Research, 2018.](https://mlanthology.org/jair/2018/zhang2018jair-transitionbased/) doi:10.1613/JAIR.1.11266

BibTeX

@article{zhang2018jair-transitionbased,
  title     = {{Transition-Based Neural Word Segmentation Using Word-Level Features}},
  author    = {Zhang, Meishan and Zhang, Yue and Fu, Guohong},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2018},
  pages     = {923-953},
  doi       = {10.1613/JAIR.1.11266},
  volume    = {63},
  url       = {https://mlanthology.org/jair/2018/zhang2018jair-transitionbased/}
}