InfoScan: Information-Efficient Visual Scanning via Resource-Adaptive Walks

Abstract

High-resolution visual representation learning remains challenging due to the quadratic complexity of Vision Transformers and the limitations of existing efficient approaches, where fixed scanning patterns in recent Mamba-based models hinder content-adaptive perception. To address these limitations, a novel Information-aware Scanning mechanism (InfoScan) tailored for state-space visual backbones is proposed, which dynamically allocates computational resources to the most salient regions of an image. Specifically, InfoScan rigorously assesses the informativeness of image patches by integrating entropy with local structural analyses, formulates a joint optimization objective balancing fine-grained detail preservation and broader contextual coherence, and learns an adaptive scanning policy via reinforcement learning. Built upon the innovative Visual Information State Space (VISS) block, InfoScan establishes a new family of models that achieve superior efficiency-accuracy trade-offs across diverse tasks. Extensive empirical evaluation in different downstream vision tasks demonstrates that our information-driven dynamic scanning paradigm offers a robust and principled alternative to fixed or global-first traversal methods. Collectively, our work positions adaptive, content-aware processing as a promising and effective new paradigm for efficient high-resolution visual representation.

Cite

Text

Wu et al. "InfoScan: Information-Efficient Visual Scanning via Resource-Adaptive Walks." International Conference on Learning Representations, 2026.

Markdown

[Wu et al. "InfoScan: Information-Efficient Visual Scanning via Resource-Adaptive Walks." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wu2026iclr-infoscan/)

BibTeX

@inproceedings{wu2026iclr-infoscan,
  title     = {{InfoScan: Information-Efficient Visual Scanning via Resource-Adaptive Walks}},
  author    = {Wu, Yifeng and Huang, Huimin and Zhou, Shangjie and Huang, Yawen and Zheng, Hao and Chen, Yun and Wu, Xian and Han, Ruize and Chen, Guanhua},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wu2026iclr-infoscan/}
}