Semi-Markov Conditional Random Fields for Information Extraction

Abstract

We describe semi-Markov conditional random fields (semi-CRFs), a con- ditionally trained version of semi-Markov chains. Intuitively, a semi- CRF on an input sequence x outputs a “segmentation” of x, in which labels are assigned to segments (i.e., subsequences) of x rather than to individual elements xi of x. Importantly, features for semi-CRFs can measure properties of segments, and transitions within a segment can be non-Markovian. In spite of this additional power, exact learning and inference algorithms for semi-CRFs are polynomial-time—often only a small constant factor slower than conventional CRFs. In experiments on five named entity recognition problems, semi-CRFs generally outper- form conventional CRFs.

Cite

Text

Sarawagi and Cohen. "Semi-Markov Conditional Random Fields for Information Extraction." Neural Information Processing Systems, 2004.

Markdown

[Sarawagi and Cohen. "Semi-Markov Conditional Random Fields for Information Extraction." Neural Information Processing Systems, 2004.](https://mlanthology.org/neurips/2004/sarawagi2004neurips-semimarkov/)

BibTeX

@inproceedings{sarawagi2004neurips-semimarkov,
  title     = {{Semi-Markov Conditional Random Fields for Information Extraction}},
  author    = {Sarawagi, Sunita and Cohen, William W.},
  booktitle = {Neural Information Processing Systems},
  year      = {2004},
  pages     = {1185-1192},
  url       = {https://mlanthology.org/neurips/2004/sarawagi2004neurips-semimarkov/}
}