Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Ohashi, Atsumoto; Higashinaka, Ryuichiro

doi:10.1609/AAAI.V39I23.34681

Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Atsumoto Ohashi, Ryuichiro Higashinaka

AAAI 2025 pp. 24975-24983

doi:10.1609/AAAI.V39I23.34681 /aaai/2025/ohashi2025aaai-universal/

Abstract

Post-processing networks (PPNs) are components that modify the outputs of arbitrary modules in task-oriented dialogue systems and are optimized using reinforcement learning (RL) to improve the overall task completion capability of the system. However, previous PPN-based approaches have been limited to handling only a subset of modules within a system, which poses a significant limitation in improving the system performance. In this study, we propose a joint optimization method for post-processing the outputs of all modules using universal post-processing networks (UniPPNs), which are language-model-based networks that can modify the outputs of arbitrary modules in a system as a sequence-transformation task. Moreover, our RL algorithm, which employs a module-level Markov decision process, enables fine-grained value and advantage estimation for each module, thereby stabilizing joint learning for post-processing the outputs of all modules. Through both simulation-based and human evaluation experiments using the MultiWOZ dataset, we demonstrated that UniPPN outperforms conventional PPNs in the task completion capability of task-oriented dialogue systems.

PDF AAAI Semantic Scholar

Cite

Text

Ohashi and Higashinaka. "Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I23.34681

Markdown

[Ohashi and Higashinaka. "Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/ohashi2025aaai-universal/) doi:10.1609/AAAI.V39I23.34681

BibTeX

@inproceedings{ohashi2025aaai-universal,
  title     = {{Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems}},
  author    = {Ohashi, Atsumoto and Higashinaka, Ryuichiro},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {24975-24983},
  doi       = {10.1609/AAAI.V39I23.34681},
  url       = {https://mlanthology.org/aaai/2025/ohashi2025aaai-universal/}
}