Languages as Hyperplanes: Grammatical Inference with String Kernels

Abstract

Using string kernels, languages can be represented as hyperplanes in a high dimensional feature space. We discuss the language-theoretic properties of this formalism with particular reference to the implicit feature maps defined by string kernels, considering the expressive power of the formalism, its closure properties and its relationship to other formalisms. We present a new family of grammatical inference algorithms based on this idea. We demonstrate that some mildly context-sensitive languages can be represented in this way and that it is possible to efficiently learn these using kernel PCA. We experimentally demonstrate the effectiveness of this approach on some standard examples of context-sensitive languages using small synthetic data sets.

Cite

Text

Clark et al. "Languages as Hyperplanes: Grammatical Inference with String Kernels." Machine Learning, 2011. doi:10.1007/S10994-010-5218-3

Markdown

[Clark et al. "Languages as Hyperplanes: Grammatical Inference with String Kernels." Machine Learning, 2011.](https://mlanthology.org/mlj/2011/clark2011mlj-languages/) doi:10.1007/S10994-010-5218-3

BibTeX

@article{clark2011mlj-languages,
  title     = {{Languages as Hyperplanes: Grammatical Inference with String Kernels}},
  author    = {Clark, Alexander and Florêncio, Christophe Costa and Watkins, Chris},
  journal   = {Machine Learning},
  year      = {2011},
  pages     = {351-373},
  doi       = {10.1007/S10994-010-5218-3},
  volume    = {82},
  url       = {https://mlanthology.org/mlj/2011/clark2011mlj-languages/}
}