CLUSTSEG: Clustering for Universal Segmentation

Abstract

We present CLUSTSEG, a general, transformer-based framework that tackles different image segmentation tasks ($i.e.,$ superpixel, semantic, instance, and panoptic) through a unified, neural clustering scheme. Regarding queries as cluster centers, CLUSTSEG is innovative in two aspects: 1) cluster centers are initialized in heterogeneous ways so as to pointedly address task-specific demands ($e.g.,$ instance- or category-level distinctiveness), yet without modifying the architecture; and 2) pixel-cluster assignment, formalized in a cross-attention fashion, is alternated with cluster center update, yet without learning additional parameters. These innovations closely link CLUSTSEG to EM clustering and make it a transparent and powerful framework that yields superior results across the above segmentation tasks.

Cite

Text

Liang et al. "CLUSTSEG: Clustering for Universal Segmentation." International Conference on Machine Learning, 2023.

Markdown

[Liang et al. "CLUSTSEG: Clustering for Universal Segmentation." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/liang2023icml-clustseg/)

BibTeX

@inproceedings{liang2023icml-clustseg,
  title     = {{CLUSTSEG: Clustering for Universal Segmentation}},
  author    = {Liang, James Chenhao and Zhou, Tianfei and Liu, Dongfang and Wang, Wenguan},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {20787-20809},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/liang2023icml-clustseg/}
}