Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-End Whole Slide Image Analysis

Abstract

Histopathology plays a critical role in medical diagnostics, with whole slide images (WSIs) offering valuable insights that directly influence clinical decision-making. However, the large size and complexity of WSIs may pose significant challenges for deep learning models, in both computational efficiency and effective representation learning. In this work, we introduce Pixel-Mamba, a novel deep learning architecture designed to efficiently handle gigapixel WSIs. Pixel-Mamba leverages the Mamba module, a state-space model (SSM) with linear memory complexity, and incorporates local inductive biases through progressively expanding tokens, akin to convolutional neural networks. This enables Pixel-Mamba to hierarchically combine both local and global information while efficiently addressing computational challenges. Remarkably, Pixel-Mamba achieves or even surpasses the quantitative performance of state-of-the-art (SOTA) foundation models that were pretrained on millions of WSIs or WSI-text pairs, in a range of tumor staging and survival analysis tasks, even without requiring any pathology-specific pretraining. Extensive experiments demonstrate the efficacy of Pixel-Mamba as a powerful and efficient framework for end-to-end WSI analysis.

Cite

Text

Qiu et al. "Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-End Whole Slide Image Analysis." International Conference on Computer Vision, 2025.

Markdown

[Qiu et al. "Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-End Whole Slide Image Analysis." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/qiu2025iccv-bridging/)

BibTeX

@inproceedings{qiu2025iccv-bridging,
  title     = {{Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba for End-to-End Whole Slide Image Analysis}},
  author    = {Qiu, Zhongwei and Chao, Hanqing and Lin, Tiancheng and Chang, Wanxing and Yang, Zijiang and Jiao, Wenpei and Shen, Yixuan and Zhang, Yunshuo and Yang, Yelin and Liu, Wenbin and Jiang, Hui and Bian, Yun and Yan, Ke and Jin, Dakai and Lu, Le},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {22738-22747},
  url       = {https://mlanthology.org/iccv/2025/qiu2025iccv-bridging/}
}