Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Abstract

Open-vocabulary part segmentation (OVPS) is an emerging research area focused on segmenting fine-grained entities using diverse and previously unseen vocabularies.Our study highlights the inherent complexities of part segmentation due to intricate boundaries and diverse granularity, reflecting the knowledge-based nature of part identification.To address these challenges, we propose PartCLIPSeg, a novel framework utilizing generalized parts and object-level contexts to mitigate the lack of generalization in fine-grained parts.PartCLIPSeg integrates competitive part relationships and attention control, alleviating ambiguous boundaries and underrepresented parts.Experimental results demonstrate that PartCLIPSeg outperforms existing state-of-the-art OVPS methods, offering refined segmentation and an advanced understanding of part relationships within images.Through extensive experiments, our model demonstrated a significant improvement over the state-of-the-art models on the Pascal-Part-116, ADE20K-Part-234, and PartImageNet datasets.

Cite

Text

Choi et al. "Understanding Multi-Granularity for Open-Vocabulary Part Segmentation." Neural Information Processing Systems, 2024. doi:10.52202/079017-4358

Markdown

[Choi et al. "Understanding Multi-Granularity for Open-Vocabulary Part Segmentation." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/choi2024neurips-understanding/) doi:10.52202/079017-4358

BibTeX

@inproceedings{choi2024neurips-understanding,
  title     = {{Understanding Multi-Granularity for Open-Vocabulary Part Segmentation}},
  author    = {Choi, Jiho and Lee, Seonho and Lee, Seungho and Lee, Minhyun and Shim, Hyunjung},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-4358},
  url       = {https://mlanthology.org/neurips/2024/choi2024neurips-understanding/}
}