DocKS-RAG: Optimizing Document-Level Relation Extraction Through LLM-Enhanced Hybrid Prompt Tuning

Abstract

Document-level relation extraction (RE) aims to extract comprehensive correlations between entities and relations from documents. Most of existing works conduct transfer learning on pre-trained language models (PLMs), which allows for richer contextual representation to improve the performance. However, such PLMs-based methods suffer from incorporating structural knowledge, such as entity-entity interactions. Moreover, current works struggle to infer the implicit relations between entities across different sentences, which results in poor prediction. To deal with the above issues, we propose a novel and effective framework, named DocKS-RAG, which introduces extra structural knowledge and semantic information to further enhance the performance of document-level RE. Specifically, we construct a Document-level Knowledge Graph from the observable documentation data to better capture the structural information between entities and relations. Then, a Sentence-level Semantic Retrieval-Augmented Generation mechanism is designed to consider the similarity in different sentences by retrieving the relevant contextual semantic information. Furthermore, we present a hybrid-prompt tuning method on large language models (LLMs) for specific document-level RE tasks. Finally, extensive experiments conducted on two benchmark datasets demonstrate that our proposed framework enhances all the metrics compared with state-of-the-art methods.

Cite

Text

Xu et al. "DocKS-RAG: Optimizing Document-Level Relation Extraction Through LLM-Enhanced Hybrid Prompt Tuning." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Xu et al. "DocKS-RAG: Optimizing Document-Level Relation Extraction Through LLM-Enhanced Hybrid Prompt Tuning." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/xu2025icml-docksrag/)

BibTeX

@inproceedings{xu2025icml-docksrag,
  title     = {{DocKS-RAG: Optimizing Document-Level Relation Extraction Through LLM-Enhanced Hybrid Prompt Tuning}},
  author    = {Xu, Xiaolong and Zhou, Yibo and Xiang, Haolong and Li, Xiaoyong and Zhang, Xuyun and Qi, Lianyong and Dou, Wanchun},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {69936-69949},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/xu2025icml-docksrag/}
}