SAM 3: Segment Anything with Concepts

Carion, Nicolas; Gustafson, Laura; Hu, Yuan-Ting; Debnath, Shoubhik; Hu, Ronghang; Coll-Vinent, Didac Suris; Ryali, Chaitanya; Alwala, Kalyan Vasudev; Khedr, Haitham; Huang, Andrew; Lei, Jie; Ma, Tengyu; Guo, Baishan; Kalla, Arpit; Marks, Markus; Greer, Joseph; Wang, Meng; Sun, Peize; Rädle, Roman; Afouras, Triantafyllos; Mavroudi, Effrosyni; Xu, Katherine; Wu, Tsung-Han; Zhou, Yu; Momeni, Liliane; Hazra, Rishi; Ding, Shuangrui; Vaze, Sagar; Porcher, Francois; Li, Feng; Li, Siyuan; Kamath, Aishwarya; Cheng, Ho Kei; Dollar, Piotr; Ravi, Nikhila; Saenko, Kate; Zhang, Pengchuan; Feichtenhofer, Christoph

SAM 3: Segment Anything with Concepts

ICLR 2026

/iclr/2026/carion2026iclr-sam/

Abstract

We present Segment Anything Model (SAM) 3, a unified model that detects, segments, and tracks objects in images and videos based on concept prompts, which we define as either short noun phrases (e.g., “yellow school bus”), image exemplars, or a combination of both. Promptable Concept Segmentation (PCS) takes such prompts and returns segmentation masks and unique identities for all matching object instances. To advance PCS, we build a scalable data engine that produces a high-quality dataset with 4M unique concept labels, including hard negatives, across images and videos. Our model consists of an image-level detector and a memory-based video tracker that share a single backbone. Recognition and localization are decoupled with a presence head, which boosts detection accuracy. SAM 3 doubles the accuracy of existing systems in both image and video PCS, and improves previous SAM capabilities on visual segmentation tasks. We open source SAM 3 along with our new Segment Anything with Concepts (SA-Co) benchmark for promptable concept segmentation.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Carion et al. "SAM 3: Segment Anything with Concepts." International Conference on Learning Representations, 2026.

Markdown

[Carion et al. "SAM 3: Segment Anything with Concepts." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/carion2026iclr-sam/)

BibTeX

@inproceedings{carion2026iclr-sam,
  title     = {{SAM 3: Segment Anything with Concepts}},
  author    = {Carion, Nicolas and Gustafson, Laura and Hu, Yuan-Ting and Debnath, Shoubhik and Hu, Ronghang and Coll-Vinent, Didac Suris and Ryali, Chaitanya and Alwala, Kalyan Vasudev and Khedr, Haitham and Huang, Andrew and Lei, Jie and Ma, Tengyu and Guo, Baishan and Kalla, Arpit and Marks, Markus and Greer, Joseph and Wang, Meng and Sun, Peize and Rädle, Roman and Afouras, Triantafyllos and Mavroudi, Effrosyni and Xu, Katherine and Wu, Tsung-Han and Zhou, Yu and Momeni, Liliane and Hazra, Rishi and Ding, Shuangrui and Vaze, Sagar and Porcher, Francois and Li, Feng and Li, Siyuan and Kamath, Aishwarya and Cheng, Ho Kei and Dollar, Piotr and Ravi, Nikhila and Saenko, Kate and Zhang, Pengchuan and Feichtenhofer, Christoph},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/carion2026iclr-sam/}
}