Identification with Probability One of Stochastic Deterministic Linear Languages

Abstract

Learning context-free grammars is generally considered a very hard task. This is even more the case when learning has to be done from positive examples only. In this context one possibility is to learn stochastic context-free grammars, by making the implicit assumption that the distribution of the examples is given by such an object. Nevertheless this is still a hard task for which no algorithm is known. We use recent results to introduce a proper subclass of linear grammars, called deterministic linear grammars, for which we prove that a small canonical form can be found. This has been a successful condition for a learning algorithm to be possible. We propose an algorithm for this class of grammars and we prove that our algorithm works in polynomial time, and structurally converges to the target in the paradigm of identification in the limit with probability 1. Although this does not ensure that only a polynomial size sample is necessary for learning to be possible, we argue that the criterion means that no added (hidden) bias is present.

Cite

Text

de la Higuera and Oncina. "Identification with Probability One of Stochastic Deterministic Linear Languages." International Conference on Algorithmic Learning Theory, 2003. doi:10.1007/978-3-540-39624-6_20

Markdown

[de la Higuera and Oncina. "Identification with Probability One of Stochastic Deterministic Linear Languages." International Conference on Algorithmic Learning Theory, 2003.](https://mlanthology.org/alt/2003/delahiguera2003alt-identification/) doi:10.1007/978-3-540-39624-6_20

BibTeX

@inproceedings{delahiguera2003alt-identification,
  title     = {{Identification with Probability One of Stochastic Deterministic Linear Languages}},
  author    = {de la Higuera, Colin and Oncina, José},
  booktitle = {International Conference on Algorithmic Learning Theory},
  year      = {2003},
  pages     = {247-258},
  doi       = {10.1007/978-3-540-39624-6_20},
  url       = {https://mlanthology.org/alt/2003/delahiguera2003alt-identification/}
}