Post Hoc Regression Refinement via Pairwise Rankings
Abstract
Accurate prediction of continuous properties is essential to many scientific and engineering tasks. Although deep-learning regressors excel with abundant labels, their accuracy deteriorates in data-scarce regimes. We introduce RankRefine, a model-agnostic, plug-and-play post-hoc refinement technique that injects expert knowledge through pairwise rankings. Given a query item and a small reference set with known properties, RankRefine combines the base regressor’s output with a rank-based estimate via inverse-variance weighting, requiring no retraining. In molecular property prediction task, RankRefine achieves up to 10\% relative reduction in mean absolute error using only 20 pairwise comparisons obtained through a general-purpose large language model (LLM) with no finetuning. As rankings provided by human experts or general-purpose LLMs are sufficient for improving regression across diverse domains, RankRefine offers practicality and broad applicability, especially in low-data settings.
Cite
Text
Wijaya et al. "Post Hoc Regression Refinement via Pairwise Rankings." Advances in Neural Information Processing Systems, 2025.Markdown
[Wijaya et al. "Post Hoc Regression Refinement via Pairwise Rankings." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wijaya2025neurips-post/)BibTeX
@inproceedings{wijaya2025neurips-post,
title = {{Post Hoc Regression Refinement via Pairwise Rankings}},
author = {Wijaya, Kevin Tirta and Sun, Michael and Guo, Minghao and Seidel, Hans-peter and Matusik, Wojciech and Babaei, Vahid},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/wijaya2025neurips-post/}
}