Segment Anything
Abstract
We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision. We recommend reading the full paper at: https://arxiv.org/abs/2304.02643.
Cite
Text
Kirillov et al. "Segment Anything." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00371Markdown
[Kirillov et al. "Segment Anything." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/kirillov2023iccv-segment/) doi:10.1109/ICCV51070.2023.00371BibTeX
@inproceedings{kirillov2023iccv-segment,
title = {{Segment Anything}},
author = {Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Dollar, Piotr and Girshick, Ross},
booktitle = {International Conference on Computer Vision},
year = {2023},
pages = {4015-4026},
doi = {10.1109/ICCV51070.2023.00371},
url = {https://mlanthology.org/iccv/2023/kirillov2023iccv-segment/}
}