A Simple Image Segmentation Framework via In-Context Examples
Abstract
Recently, there have been explorations of generalist segmentation models that can effectively tackle a variety of image segmentation tasks within a unified in-context learning framework. However, these methods still struggle with task ambiguity in in-context segmentation, as not all in-context examples can accurately convey the task information. In order to address this issue, we present SINE, a simple image $\textbf{S}$egmentation framework utilizing $\textbf{in}$-context $\textbf{e}$xamples. Our approach leverages a Transformer encoder-decoder structure, where the encoder provides high-quality image representations, and the decoder is designed to yield multiple task-specific output masks to eliminate task ambiguity effectively. Specifically, we introduce an In-context Interaction module to complement in-context information and produce correlations between the target image and the in-context example and a Matching Transformer that uses fixed matching and a Hungarian algorithm to eliminate differences between different tasks. In addition, we have further perfected the current evaluation system for in-context image segmentation, aiming to facilitate a holistic appraisal of these models. Experiments on various segmentation tasks show the effectiveness of the proposed method.
Cite
Text
Liu et al. "A Simple Image Segmentation Framework via In-Context Examples." Neural Information Processing Systems, 2024. doi:10.52202/079017-0791Markdown
[Liu et al. "A Simple Image Segmentation Framework via In-Context Examples." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/liu2024neurips-simple/) doi:10.52202/079017-0791BibTeX
@inproceedings{liu2024neurips-simple,
title = {{A Simple Image Segmentation Framework via In-Context Examples}},
author = {Liu, Yang and Jing, Chenchen and Li, Hengtao and Zhu, Muzhi and Chen, Hao and Wang, Xinlong and Shen, Chunhua},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-0791},
url = {https://mlanthology.org/neurips/2024/liu2024neurips-simple/}
}