Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System

Satinder Singh, Diane J. Litman, Michael J. Kearns, Marilyn A. Walker

JAIR 2002 pp. 105-133

doi:10.1613/JAIR.859 /jair/2002/singh2002jair-optimizing/

Abstract

Designing the dialogue policy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing a dialogue policy, which addresses the technical challenges in applying reinforcement learning to a working dialogue system with human users. We report on the design, construction and empirical evaluation of NJFun, an experimental spoken dialogue system that provides users with access to information about fun things to do in New Jersey. Our results show that by optimizing its performance via reinforcement learning, NJFun measurably improves system performance.

PDF JAIR Semantic Scholar

Cite

Text

Singh et al. "Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System." Journal of Artificial Intelligence Research, 2002. doi:10.1613/JAIR.859

Markdown

[Singh et al. "Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System." Journal of Artificial Intelligence Research, 2002.](https://mlanthology.org/jair/2002/singh2002jair-optimizing/) doi:10.1613/JAIR.859

BibTeX

@article{singh2002jair-optimizing,
  title     = {{Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System}},
  author    = {Singh, Satinder and Litman, Diane J. and Kearns, Michael J. and Walker, Marilyn A.},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2002},
  pages     = {105-133},
  doi       = {10.1613/JAIR.859},
  volume    = {16},
  url       = {https://mlanthology.org/jair/2002/singh2002jair-optimizing/}
}