KnowProxy: Adapting Large Language Models by Knowledge-Guided Proxy

Abstract

Adapting large language models (LLMs) using smaller proxy models has been shown to improve training efficiency, where the LLMs remain frozen while the proxies are tuned on top. However, this approach typically requires access to the output probability distributions of LLMs, which are often inaccessible or unstable. To address this limitation, we propose KnowProxy, a knowledge-guided proxy framework in which the proxy is trained with textual knowledge rather than probability distributions. Specifically, we first elicit textual knowledge and reasoning from frozen LLMs through prompting, and then the proxy model learns to adapt this reasoning to target task distributions. We evaluate KnowProxy on diverse reasoning benchmarks with different fine-tuning scenarios. Comprehensive results show that KnowProxy achieves competitive or even better performance without direct access to probability distributions, thereby providing a scalable and versatile alternative to traditional fine-tuning.

Cite

Text

Lee et al. "KnowProxy: Adapting Large Language Models by Knowledge-Guided Proxy." International Conference on Learning Representations, 2026.

Markdown

[Lee et al. "KnowProxy: Adapting Large Language Models by Knowledge-Guided Proxy." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/lee2026iclr-knowproxy/)

BibTeX

@inproceedings{lee2026iclr-knowproxy,
  title     = {{KnowProxy: Adapting Large Language Models by Knowledge-Guided Proxy}},
  author    = {Lee, Gukhyeon and Kim, Yeachan and Lee, SangKeun},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/lee2026iclr-knowproxy/}
}