SALAS: Supervised Aspect Learning Improves Abstractive Multi-Document Summarization Through Aspect Information Loss
Abstract
Abstractive multi-document summarization (MDS) aims at summarizing and paraphrasing the salient key information in multiple documents. For dealing with the long-input issue brought by multiple documents, most previous work extracts salient sentence-level information from the input documents and then performs summarizing on the extracted information. However, the aspects of documents are neglected. The limited ability to discover the content on certain aspects hampers the key information seeking and ruins the comprehensiveness of the generated summaries. To solve the issue, we propose a novel S upervised A spect- L earning A bstractive S ummarization framework (SALAS) and a new aspect information loss (AILoss) to learn aspect information to supervise the generating process heuristically. Specifically, SALAS adopts three probes to capture aspect information as both constraints of the objective function and supplement information to be expressed in the representations. Aspect information is explicitly discovered and exploited to facilitate generating comprehensive summaries by AILoss. We conduct extensive experiments on three public datasets. The experimental results demonstrate that SALAS outperforms previous state-of-the-art (SOTA) baselines, achieving a new SOTA performance on the three MDS datasets. We make our code for SALAS publicly available ( https://github.com/Hytn/AspectSum ).
Cite
Text
Chen et al. "SALAS: Supervised Aspect Learning Improves Abstractive Multi-Document Summarization Through Aspect Information Loss." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023. doi:10.1007/978-3-031-43421-1_4Markdown
[Chen et al. "SALAS: Supervised Aspect Learning Improves Abstractive Multi-Document Summarization Through Aspect Information Loss." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023.](https://mlanthology.org/ecmlpkdd/2023/chen2023ecmlpkdd-salas/) doi:10.1007/978-3-031-43421-1_4BibTeX
@inproceedings{chen2023ecmlpkdd-salas,
title = {{SALAS: Supervised Aspect Learning Improves Abstractive Multi-Document Summarization Through Aspect Information Loss}},
author = {Chen, Haotian and Zhang, Han and Guo, Houjing and Yi, Shuchang and Chen, Bingsheng and Zhou, Xiangdong},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2023},
pages = {55-70},
doi = {10.1007/978-3-031-43421-1_4},
url = {https://mlanthology.org/ecmlpkdd/2023/chen2023ecmlpkdd-salas/}
}