AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking

Abstract

LLMs often need effective configurations, like temperature and reasoning steps, to handle tasks requiring sophisticated reasoning and problem-solving, ranging from joke generation to mathematical reasoning. Existing prompting approaches usually adopt general-purpose, fixed configurations that work “well enough” across tasks but seldom achieve task-specific optimality. To address this gap, we introduce AdaReasoner, an LLM-agnostic plugin designed for any LLM to automate adaptive reasoning configurations for tasks requiring different types of thinking. AdaReasoner is trained using a reinforcement learning (RL) framework, combining a factorized action space with a targeted exploration strategy, along with a pretrained reward model to optimize the policy model for reasoning configurations with only a few-shot guide. AdaReasoner is backed by theoretical guarantees and experiments of fast convergence and a sublinear policy gap. Across six different LLMs and a variety of reasoning tasks, it consistently outperforms standard baselines, preserves out-of-distribution robustness, and yield gains on knowledge-intensive tasks through tailored prompts.

Cite

Text

Wang et al. "AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wang et al. "AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wang2025neurips-adareasoner/)

BibTeX

@inproceedings{wang2025neurips-adareasoner,
  title     = {{AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking}},
  author    = {Wang, Xiangqi and Huang, Yue and Wang, Yanbo and Luo, Xiaonan and Guo, Kehan and Zhou, Yujun and Zhang, Xiangliang},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wang2025neurips-adareasoner/}
}