Post Hoc Regression Refinement via Pairwise Rankings

Abstract

Accurate prediction of continuous properties is essential to many scientific and engineering tasks. Although deep-learning regressors excel with abundant labels, their accuracy deteriorates in data-scarce regimes. We introduce RankRefine, a model-agnostic, plug-and-play post-hoc refinement technique that injects expert knowledge through pairwise rankings. Given a query item and a small reference set with known properties, RankRefine combines the base regressor’s output with a rank-based estimate via inverse-variance weighting, requiring no retraining. In molecular property prediction task, RankRefine achieves up to 10\% relative reduction in mean absolute error using only 20 pairwise comparisons obtained through a general-purpose large language model (LLM) with no finetuning. As rankings provided by human experts or general-purpose LLMs are sufficient for improving regression across diverse domains, RankRefine offers practicality and broad applicability, especially in low-data settings.

Cite

Text

Wijaya et al. "Post Hoc Regression Refinement via Pairwise Rankings." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wijaya et al. "Post Hoc Regression Refinement via Pairwise Rankings." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wijaya2025neurips-post/)

BibTeX

@inproceedings{wijaya2025neurips-post,
  title     = {{Post Hoc Regression Refinement via Pairwise Rankings}},
  author    = {Wijaya, Kevin Tirta and Sun, Michael and Guo, Minghao and Seidel, Hans-peter and Matusik, Wojciech and Babaei, Vahid},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wijaya2025neurips-post/}
}