Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation

Gao, Yang; Meyer, Christian M.; Mesgar, Mohsen; Gurevych, Iryna

doi:10.24963/IJCAI.2019/326

Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation

Yang Gao, Christian M. Meyer, Mohsen Mesgar, Iryna Gurevych

IJCAI 2019 pp. 2350-2356

doi:10.24963/IJCAI.2019/326 /ijcai/2019/gao2019ijcai-reward/

Abstract

Document summarisation can be formulated as a sequential decision-making problem, which can be solved by Reinforcement Learning (RL) algorithms. The predominant RL paradigm for summarisation learns a cross-input policy, which requires considerable time, data and parameter tuning due to the huge search spaces and the delayed rewards. Learning input-specific RL policies is a more efficient alternative, but so far depends on handcrafted rewards, which are difficult to design and yield poor performance. We propose RELIS, a novel RL paradigm that learns a reward function with Learning-to-Rank (L2R) algorithms at training time and uses this reward function to train an input-specific RL policy at test time. We prove that RELIS guarantees to generate near-optimal summaries with appropriate L2R and RL algorithms. Empirically, we evaluate our approach on extractive multi-document summarisation. We show that RELIS reduces the training time by two orders of magnitude compared to the state-of-the-art models while performing on par with them.

PDF IJCAI Semantic Scholar

Cite

Text

Gao et al. "Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/326

Markdown

[Gao et al. "Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/gao2019ijcai-reward/) doi:10.24963/IJCAI.2019/326

BibTeX

@inproceedings{gao2019ijcai-reward,
  title     = {{Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation}},
  author    = {Gao, Yang and Meyer, Christian M. and Mesgar, Mohsen and Gurevych, Iryna},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {2350-2356},
  doi       = {10.24963/IJCAI.2019/326},
  url       = {https://mlanthology.org/ijcai/2019/gao2019ijcai-reward/}
}