InfoSAM: Fine-Tuning the Segment Anything Model from an Information-Theoretic Perspective

Abstract

The Segment Anything Model (SAM), a vision foundation model, exhibits impressive zero-shot capabilities in general tasks but struggles in specialized domains. Parameter-efficient fine-tuning (PEFT) is a promising approach to unleash the potential of SAM in novel scenarios. However, existing PEFT methods for SAM neglect the domain-invariant relations encoded in the pre-trained model. To bridge this gap, we propose InfoSAM, an information-theoretic approach that enhances SAM fine-tuning by distilling and preserving its pre-trained segmentation knowledge. Specifically, we formulate the knowledge transfer process as two novel mutual information-based objectives: (i) to compress the domain-invariant relation extracted from pre-trained SAM, excluding pseudo-invariant information as possible, and (ii) to maximize mutual information between the relational knowledge learned by the teacher (pre-trained SAM) and the student (fine-tuned model). The proposed InfoSAM establishes a robust distillation framework for PEFT of SAM. Extensive experiments across diverse benchmarks validate InfoSAM’s effectiveness in improving SAM family’s performance on real-world tasks, demonstrating its adaptability and superiority in handling specialized scenarios. The code and models are available at https://muyaoyuan.github.io/InfoSAM_Page.

Cite

Text

Zhang et al. "InfoSAM: Fine-Tuning the Segment Anything Model from an Information-Theoretic Perspective." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Zhang et al. "InfoSAM: Fine-Tuning the Segment Anything Model from an Information-Theoretic Perspective." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/zhang2025icml-infosam/)

BibTeX

@inproceedings{zhang2025icml-infosam,
  title     = {{InfoSAM: Fine-Tuning the Segment Anything Model from an Information-Theoretic Perspective}},
  author    = {Zhang, Yuanhong and Yuan, Muyao and Zhang, Weizhan and Gong, Tieliang and Wen, Wen and Ying, Jiangyong and Shi, Weijie},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {76655-76677},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/zhang2025icml-infosam/}
}