Why Keep Your Doubts to Yourself? Trading Visual Uncertainties Among Vision-Language Models

Zhang, Jusheng; Fan, Yijia; Cai, Kaitong; Yang, Jing; Yao, Jiawei; Wang, Jian; Qu, Guanlong; Chen, Ziliang; Wang, Keze

Why Keep Your Doubts to Yourself? Trading Visual Uncertainties Among Vision-Language Models

Jusheng Zhang, Yijia Fan, Kaitong Cai, Jing Yang, Jiawei Yao, Jian Wang, Guanlong Qu, Ziliang Chen, Keze Wang

ICLR 2026

/iclr/2026/zhang2026iclr-keep/

Abstract

Vision-Language Models (VLMs) enable powerful multi-agent systems, but scaling them is economically unsustainable: coordinating heterogeneous agents under information asymmetry often spirals costs. Existing paradigms, such as Mixture-of-Agents and knowledge-based routers, rely on heuristic proxies that ignore costs and collapse uncertainty structure, leading to provably suboptimal coordination. We introduce Agora, a framework that reframes coordination as a decentralized market for uncertainty. Agora formalizes epistemic uncertainty into a structured, tradable asset (perceptual, semantic, inferential), and enforces profitability-driven trading among agents based on rational economic rules. A market-aware broker, extending Thompson Sampling, initiates collaboration and guides the system toward cost-efficient equilibria. Experiments on five multimodal benchmarks (MMMU, MMBench, MathVision, InfoVQA, CC-OCR) show that Agora outperforms strong VLMs and heuristic multi-agent strategies, e.g., achieving +8.5% accuracy over the best baseline on MMMU while reducing cost by over 3×. These results establish market-based coordination as a principled and scalable paradigm for building economically viable multi-agent visual intelligence systems.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Zhang et al. "Why Keep Your Doubts to Yourself? Trading Visual Uncertainties Among Vision-Language Models." International Conference on Learning Representations, 2026.

Markdown

[Zhang et al. "Why Keep Your Doubts to Yourself? Trading Visual Uncertainties Among Vision-Language Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/zhang2026iclr-keep/)

BibTeX

@inproceedings{zhang2026iclr-keep,
  title     = {{Why Keep Your Doubts to Yourself? Trading Visual Uncertainties Among Vision-Language Models}},
  author    = {Zhang, Jusheng and Fan, Yijia and Cai, Kaitong and Yang, Jing and Yao, Jiawei and Wang, Jian and Qu, Guanlong and Chen, Ziliang and Wang, Keze},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/zhang2026iclr-keep/}
}