OuroMamba: A Data-Free Quantization Framework for Vision Mamba
Abstract
We present OuroMamba, the first data-free post-training quantization (DFQ) method for vision Mamba-based models (VMMs). We identify two key challenges in enabling DFQ for VMMs, (1) VMM's recurrent state transitions restricts the capturing of long-range interactions and leads to semantically weak synthetic data, (2) VMM activations exhibit dynamic outlier variations across time-steps, rendering existing static PTQ techniques ineffective. To address these challenges, OuroMamba presents a two-stage framework: (1) OuroMamba-Gen to generate semantically rich and meaningful synthetic data. It applies contrastive learning on patch level VMM features generated through neighborhood interactions in the latent state space, (2) OuroMamba-Quant to employ mixed-precision quantization with lightweight dynamic outlier detection during inference. In specific, we present a thresholding based outlier channel selection strategy for activations that gets updated every time-step. Extensive experiments across vision and generative tasks show that our data-free OuroMamba surpasses existing data-driven PTQ techniques, achieving state-of-the-art performance across diverse quantization settings. Additionally, we implement efficient GPU kernels to achieve practical latency speedup of up to 2.36x. Code and synthetic dataset are available here: https://github.com/georgia-tech-synergy-lab/ICCV-OuroMamba.
Cite
Text
Ramachandran et al. "OuroMamba: A Data-Free Quantization Framework for Vision Mamba." International Conference on Computer Vision, 2025.Markdown
[Ramachandran et al. "OuroMamba: A Data-Free Quantization Framework for Vision Mamba." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/ramachandran2025iccv-ouromamba/)BibTeX
@inproceedings{ramachandran2025iccv-ouromamba,
title = {{OuroMamba: A Data-Free Quantization Framework for Vision Mamba}},
author = {Ramachandran, Akshat and Lee, Mingyu and Xu, Huan and Kundu, Souvik and Krishna, Tushar},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {21177-21186},
url = {https://mlanthology.org/iccv/2025/ramachandran2025iccv-ouromamba/}
}