Segment Anything

Abstract

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision. We recommend reading the full paper at: https://arxiv.org/abs/2304.02643.

Cite

Text

Kirillov et al. "Segment Anything." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00371

Markdown

[Kirillov et al. "Segment Anything." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/kirillov2023iccv-segment/) doi:10.1109/ICCV51070.2023.00371

BibTeX

@inproceedings{kirillov2023iccv-segment,
  title     = {{Segment Anything}},
  author    = {Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Dollar, Piotr and Girshick, Ross},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {4015-4026},
  doi       = {10.1109/ICCV51070.2023.00371},
  url       = {https://mlanthology.org/iccv/2023/kirillov2023iccv-segment/}
}