Spectral Learning of General Weighted Automata via Constrained Matrix Completion

Abstract

Many tasks in text and speech processing and computational biology require es- timating functions mapping strings to real numbers. A broad class of such func- tions can be defined by weighted automata. Spectral methods based on the sin- gular value decomposition of a Hankel matrix have been recently proposed for learning a probability distribution represented by a weighted automaton from a training sample drawn according to this same target distribution. In this paper, we show how spectral methods can be extended to the problem of learning a general weighted automaton from a sample generated by an arbitrary distribution. The main obstruction to this approach is that, in general, some entries of the Hankel matrix may be missing. We present a solution to this problem based on solving a constrained matrix completion problem. Combining these two ingredients, matrix completion and spectral method, a whole new family of algorithms for learning general weighted automata is obtained. We present generalization bounds for a particular algorithm in this family. The proofs rely on a joint stability analysis of matrix completion and spectral learning.

Cite

Text

Balle and Mohri. "Spectral Learning of General Weighted Automata via Constrained Matrix Completion." Neural Information Processing Systems, 2012.

Markdown

[Balle and Mohri. "Spectral Learning of General Weighted Automata via Constrained Matrix Completion." Neural Information Processing Systems, 2012.](https://mlanthology.org/neurips/2012/balle2012neurips-spectral/)

BibTeX

@inproceedings{balle2012neurips-spectral,
  title     = {{Spectral Learning of General Weighted Automata via Constrained Matrix Completion}},
  author    = {Balle, Borja and Mohri, Mehryar},
  booktitle = {Neural Information Processing Systems},
  year      = {2012},
  pages     = {2159-2167},
  url       = {https://mlanthology.org/neurips/2012/balle2012neurips-spectral/}
}