Towards Multidocument Summarization by Reformulation: Progress and Prospects
Abstract
By synthesizing information common to retrieved documents, multi-document summarization can help users of information retrieval systems to find relevant documents with a minimal amount of reading. We are developing a multidocument summarization system to automatically generate a concise summary by identifying and synthesizing similarities across a set of related documents. Our approach is unique in its integration of machine learning and statistical techniques to identify similar paragraphs, intersection of similar phrases within paragraphs, and language generation to reformulate the wording of the summary. Our evaluation of system components shows that learning over multiple extracted linguistic features is more effective than information retrieval approaches at identifying similar text units for summarization and that it is possible to generate a fluent summary that conveys similarities among documents even when full semantic interpretations of the input text are not available.
Cite
Text
McKeown et al. "Towards Multidocument Summarization by Reformulation: Progress and Prospects." AAAI Conference on Artificial Intelligence, 1999. doi:10.7916/d8sb4f3vMarkdown
[McKeown et al. "Towards Multidocument Summarization by Reformulation: Progress and Prospects." AAAI Conference on Artificial Intelligence, 1999.](https://mlanthology.org/aaai/1999/mckeown1999aaai-multidocument/) doi:10.7916/d8sb4f3vBibTeX
@inproceedings{mckeown1999aaai-multidocument,
title = {{Towards Multidocument Summarization by Reformulation: Progress and Prospects}},
author = {McKeown, Kathleen R. and Klavans, Judith and Hatzivassiloglou, Vasileios and Barzilay, Regina and Eskin, Eleazar},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {1999},
pages = {453-460},
doi = {10.7916/d8sb4f3v},
url = {https://mlanthology.org/aaai/1999/mckeown1999aaai-multidocument/}
}