Transition-Based Neural Word Segmentation Using Word-Level Features
Abstract

 
 
 Character-based and word-based methods are two different solutions for Chinese word segmentation, the former exploiting sequence labeling models over characters and the latter using word-level features. Neural models have been exploited for character-based Chinese word segmentation, giving high accuracies by making use of external character embeddings, yet requiring less feature engineering. In this paper, we study a neural model for word-based Chinese word segmentation, by replacing the manually-designed discrete features with neural features in a transition-based word segmentation framework. Experimental results demonstrate that word features lead to comparable performance to the best systems in the literature, and a further combination of discrete and neural features obtains top accuracies on several benchmarks.
 
 
Cite
Text
Zhang et al. "Transition-Based Neural Word Segmentation Using Word-Level Features." Journal of Artificial Intelligence Research, 2018. doi:10.1613/JAIR.1.11266Markdown
[Zhang et al. "Transition-Based Neural Word Segmentation Using Word-Level Features." Journal of Artificial Intelligence Research, 2018.](https://mlanthology.org/jair/2018/zhang2018jair-transitionbased/) doi:10.1613/JAIR.1.11266BibTeX
@article{zhang2018jair-transitionbased,
title = {{Transition-Based Neural Word Segmentation Using Word-Level Features}},
author = {Zhang, Meishan and Zhang, Yue and Fu, Guohong},
journal = {Journal of Artificial Intelligence Research},
year = {2018},
pages = {923-953},
doi = {10.1613/JAIR.1.11266},
volume = {63},
url = {https://mlanthology.org/jair/2018/zhang2018jair-transitionbased/}
}