Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

Abstract

Recently, some non-regular subclasses of context-free grammars have been found to be efficiently learnable from positive data. In order to use these efficient algorithms to infer probabilistic languages, one must take into account not only equivalences between languages but also probabilistic generalities of grammars. The probabilistic generality of a grammar G is the class of the probabilistic languages generated by probabilistic grammars constructed on G . We introduce a subclass of simple grammars (SGs), referred as to unifiable simple grammars (USGs), which is a superclass of an efficiently learnable class, right-unique simple grammars (RSGs). We show that the class of RSGs is unifiable within the class of USGs, whereas SGs and RSGs are not unifiable within the class of SGs and RSGs, respectively. We also introduce simple context-free decision processes, which are a natural extension of finite Markov decision processes and intuitively may be thought of a Markov decision process with stacks. We propose a reinforcement learning method on simple context-free decision processes, as an application of the learning and unification algorithm for RSGs from positive data.

Cite

Text

Shibata et al. "Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning." International Conference on Algorithmic Learning Theory, 2006. doi:10.1007/11894841_28

Markdown

[Shibata et al. "Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning." International Conference on Algorithmic Learning Theory, 2006.](https://mlanthology.org/alt/2006/shibata2006alt-probabilistic/) doi:10.1007/11894841_28

BibTeX

@inproceedings{shibata2006alt-probabilistic,
  title     = {{Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning}},
  author    = {Shibata, Takeshi and Yoshinaka, Ryo and Chikayama, Takashi},
  booktitle = {International Conference on Algorithmic Learning Theory},
  year      = {2006},
  pages     = {348-362},
  doi       = {10.1007/11894841_28},
  url       = {https://mlanthology.org/alt/2006/shibata2006alt-probabilistic/}
}