Fast Newton-CG Method for Batch Learning of Conditional Random Fields

Abstract

We propose a fast batch learning method for linear-chain Conditional Random Fields (CRFs) based on Newton-CG methods. Newton-CG methods are a variant of Newton method for high-dimensional problems. They only require the Hessian-vector products instead of the full Hessian matrices. To speed up Newton-CG methods for the CRF learning, we derive a novel dynamic programming procedure for the Hessian-vector products of the CRF objective function. The proposed procedure can reuse the byproducts of the time-consuming gradient computation for the Hessian-vector products to drastically reduce the total computation time of the Newton-CG methods. In experiments with tasks in natural language processing, the proposed method outperforms a conventional quasi-Newton method. Remarkably, the proposed method is competitive with online learning algorithms that are fast but unstable.

Cite

Text

Tsuboi et al. "Fast Newton-CG Method for Batch Learning of Conditional Random Fields." AAAI Conference on Artificial Intelligence, 2011. doi:10.1609/AAAI.V25I1.7894

Markdown

[Tsuboi et al. "Fast Newton-CG Method for Batch Learning of Conditional Random Fields." AAAI Conference on Artificial Intelligence, 2011.](https://mlanthology.org/aaai/2011/tsuboi2011aaai-fast/) doi:10.1609/AAAI.V25I1.7894

BibTeX

@inproceedings{tsuboi2011aaai-fast,
  title     = {{Fast Newton-CG Method for Batch Learning of Conditional Random Fields}},
  author    = {Tsuboi, Yuta and Unno, Yuya and Kashima, Hisashi and Okazaki, Naoaki},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2011},
  pages     = {489-494},
  doi       = {10.1609/AAAI.V25I1.7894},
  url       = {https://mlanthology.org/aaai/2011/tsuboi2011aaai-fast/}
}