Dissecting Generalized Category Discovery: Multiplex Consensus Under Self-Deconstruction

Abstract

Human perceptual systems excel at inducing and recognizing objects across both known and novel categories, a capability far beyond current machine learning frameworks. While generalized category discovery (GCD) aims to bridge this gap, existing methods predominantly focus on optimizing objective functions. We present an orthogonal solution, inspired by the human cognitive process for novel object understanding: decomposing objects into visual primitives and establishing cross-knowledge comparisons. We propose ConGCD, which establishes primitive-oriented representations through high-level semantic reconstruction, binding intra-class shared attributes via deconstruction. Mirroring human preference diversity in visual processing, where distinct individuals leverage dominant or contextual cues, we implement dominant and contextual consensus units to capture class-discriminative patterns and inherent distributional invariants, respectively. A consensus scheduler dynamically optimizes activation pathways, with final predictions emerging through multiplex consensus integration. Extensive evaluations across coarse- and fine-grained benchmarks demonstrate ConGCD's effectiveness as a consensus-aware paradigm. Code is available at: https://github.com/lytang63/ConGCD.

Cite

Text

Tang et al. "Dissecting Generalized Category Discovery: Multiplex Consensus Under Self-Deconstruction." International Conference on Computer Vision, 2025.

Markdown

[Tang et al. "Dissecting Generalized Category Discovery: Multiplex Consensus Under Self-Deconstruction." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/tang2025iccv-dissecting/)

BibTeX

@inproceedings{tang2025iccv-dissecting,
  title     = {{Dissecting Generalized Category Discovery: Multiplex Consensus Under Self-Deconstruction}},
  author    = {Tang, Luyao and Huang, Kunze and Chen, Chaoqi and Yuan, Yuxuan and Li, Chenxin and Tu, Xiaotong and Ding, Xinghao and Huang, Yue},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {297-307},
  url       = {https://mlanthology.org/iccv/2025/tang2025iccv-dissecting/}
}