Thinking like an Author: A Zero-Shot Learning Approach to Keyphrase Generation with Large Language Model
Abstract
Keyphrase generation aims to automatically derive a set of phrases from a given document. Recently, automated keyphrase generation has gained prominence as a focal point of research in both industry and academia, showcasing notable advancements in performance. This paper aims to employ a Large Language Model (LLM) for a more comprehensive extraction of keyphrases from documents. We observe that many article authors adhere to a four-step process when selecting keyphrases for a document. Initially, they extract keyphrases directly from the document. In addition, some authors will also choose hypernyms of the keyphrases selected in the previous stage as new extended keyphrases. Subsequently, these authors obtain keyphrases from documents with similar topics, opting for the most appropriate ones to include. Finally, the authors organize the keyphrases obtained from the extraction, extension, and retrieval steps, discerning the final keyphrases through a ranking process. Motivated by these observations, we introduce a zero-shot learning approach for keyphrase generation using the LLM, which includes four parts. The extractor, extender, and retriever components are responsible for recalling candidate keyphrases, while the ranker component is tasked with ranking them. Experimental results demonstrate the effectiveness of our approach on various datasets. Moreover, ablation experiments shed light on the impact of each module on the model’s final performance.
Cite
Text
Wang et al. "Thinking like an Author: A Zero-Shot Learning Approach to Keyphrase Generation with Large Language Model." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70352-2_20Markdown
[Wang et al. "Thinking like an Author: A Zero-Shot Learning Approach to Keyphrase Generation with Large Language Model." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/wang2024ecmlpkdd-thinking/) doi:10.1007/978-3-031-70352-2_20BibTeX
@inproceedings{wang2024ecmlpkdd-thinking,
title = {{Thinking like an Author: A Zero-Shot Learning Approach to Keyphrase Generation with Large Language Model}},
author = {Wang, Siyu and Dai, Shengran and Jiang, Jianhui},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2024},
pages = {335-350},
doi = {10.1007/978-3-031-70352-2_20},
url = {https://mlanthology.org/ecmlpkdd/2024/wang2024ecmlpkdd-thinking/}
}