EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation
Abstract
With the increasing adoption of Retrieval-Augmented Generation (RAG) systems for knowledge-intensive tasks, ensuring the adequacy of retrieved documents has become critically important for generation quality. Traditional reranking approaches face three significant challenges: substantial computational overhead that scales with document length, dependency on plain text that limits application in sensitive scenarios, and insufficient assessment of document value beyond simple relevance metrics. We propose EAReranker, an efficient embedding-based adequacy assessment framework that evaluates document utility for RAG systems without requiring access to original text content. The framework quantifies document adequacy through a comprehensive scoring methodology considering verifiability, coverage, completeness and structural aspects, providing interpretable adequacy classifications for downstream applications. EAReranker employs a Decoder-Only Transformer architecture that introduces embedding dimension expansion method and bin-aware weighted loss, designed specifically to predict adequacy directly from embedding vectors. Our comprehensive evaluation across four public benchmarks demonstrates that EAReranker achieves competitive performance with state-of-the-art plaintext rerankers while maintaining constant memory usage ($\sim$550MB) regardless of input length and processing 2-3x faster than traditional approaches. The semantic bin adequacy prediction accuracy of 92.85\% LACC@10 and 86.12\% LACC@25 demonstrates its capability to effectively filter out inadequate documents that could potentially mislead or adversely impact RAG system performance, thereby ensuring only high-utility information serves as generation context. These results establish EAReranker as an efficient and practical solution for enhancing RAG system performance through improved context selection while addressing the computational and privacy challenges of existing methods.
Cite
Text
Zeng et al. "EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation." Advances in Neural Information Processing Systems, 2025.Markdown
[Zeng et al. "EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/zeng2025neurips-eareranker/)BibTeX
@inproceedings{zeng2025neurips-eareranker,
title = {{EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation}},
author = {Zeng, Dongyang and Liu, Yaping and Zhang, Wei and Zhang, Shuo and Liu, Xinwang and Fang, Binxing},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/zeng2025neurips-eareranker/}
}