SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Meng, Lingchen; Lan, Shiyi; Li, Hengduo; Alvarez, Jose M; Wu, Zuxuan; Jiang, Yu-Gang

doi:10.1007/978-3-031-72920-1_12

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M Alvarez, Zuxuan Wu, Yu-Gang Jiang

ECCV 2024

doi:10.1007/978-3-031-72920-1_12 /eccv/2024/meng2024eccv-segic/

Abstract

In-context segmentation aims at segmenting novel images using a few labeled example images, termed as “in-context examples”, exploring content similarities between examples and the target. The resulting models can be generalized seamlessly to novel segmentation tasks, significantly reducing the labeling and training costs compared with conventional pipelines. However, in-context segmentation is more challenging than classic ones requiring the model to learn segmentation rules conditioned on a few samples. Unlike previous work with ad-hoc or non-end-to-end designs, we propose , an end-to-end segment-in-context framework built upon a single vision foundation model (VFM). In particular, leverages the emergent correspondence within VFM to capture dense relationships between target images and in-context samples. As such, information from in-context samples is then extracted into three types of instructions, geometric, visual, and meta instructions, serving as explicit conditions for the final mask prediction. is a straightforward yet effective approach that yields state-of-the-art performance on one-shot segmentation benchmarks. Notably, can be easily generalized to diverse tasks, including video object segmentation and open-vocabulary segmentation. Code will be available at https://github.com/MengLcool/SEGIC.

PDF ECCV Semantic Scholar

Cite

Text

Meng et al. "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72920-1_12

Markdown

[Meng et al. "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/meng2024eccv-segic/) doi:10.1007/978-3-031-72920-1_12

BibTeX

@inproceedings{meng2024eccv-segic,
  title     = {{SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation}},
  author    = {Meng, Lingchen and Lan, Shiyi and Li, Hengduo and Alvarez, Jose M and Wu, Zuxuan and Jiang, Yu-Gang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72920-1_12},
  url       = {https://mlanthology.org/eccv/2024/meng2024eccv-segic/}
}