Learning Languages with Rational Kernels

Cortes, Corinna; Kontorovich, Leonid; Mohri, Mehryar

doi:10.1007/978-3-540-72927-3_26

Learning Languages with Rational Kernels

Corinna Cortes, Leonid Kontorovich, Mehryar Mohri

COLT 2007 pp. 349-364

doi:10.1007/978-3-540-72927-3_26 /colt/2007/cortes2007colt-learning/

Abstract

We present a general study of learning and linear separability with rational kernels, the sequence kernels commonly used in computational biology and natural language processing. We give a characterization of the class of all languages linearly separable with rational kernels and prove several properties of the class of languages linearly separable with a fixed rational kernel. In particular, we show that for kernels with transducer values in a finite set, these languages are necessarily finite Boolean combinations of preimages by a transducer of a single sequence. We also analyze the margin properties of linear separation with rational kernels and show that kernels with transducer values in a finite set guarantee a positive margin and lead to better learning guarantees. Creating a rational kernel with values in a finite set is often non-trivial even for relatively simple cases. However, we present a novel and general algorithm, double-tape disambiguation, that takes as input a transducer mapping sequences to sequence features, and yields an associated transducer that defines a finite range rational kernel. We describe the algorithm in detail and show its application to several cases of interest.

PDF COLT Semantic Scholar

Cite

Text

Cortes et al. "Learning Languages with Rational Kernels." Annual Conference on Computational Learning Theory, 2007. doi:10.1007/978-3-540-72927-3_26

Markdown

[Cortes et al. "Learning Languages with Rational Kernels." Annual Conference on Computational Learning Theory, 2007.](https://mlanthology.org/colt/2007/cortes2007colt-learning/) doi:10.1007/978-3-540-72927-3_26

BibTeX

@inproceedings{cortes2007colt-learning,
  title     = {{Learning Languages with Rational Kernels}},
  author    = {Cortes, Corinna and Kontorovich, Leonid and Mohri, Mehryar},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {2007},
  pages     = {349-364},
  doi       = {10.1007/978-3-540-72927-3_26},
  url       = {https://mlanthology.org/colt/2007/cortes2007colt-learning/}
}