Vision HGNN: An Image Is More than a Graph of Nodes

Abstract

The realm of graph-based modeling has proven its adaptability across diverse real-world data types. However, its applicability to general computer vision tasks had been limited until the introduction of the Vision Graph Neural Network (ViG). ViG divides input images into patches, conceptualized as nodes, constructing a graph through connections to nearest neighbors. Nonetheless, this method of graph construction confines itself to simple pairwise relationships, leading to surplus edges and unwarranted memory and computation expenses. In this paper, we enhance ViG by transcending conventional "pairwise" linkages and harnessing the power of the hypergraph to encapsulate image information. Our objective is to encompass more intricate inter-patch associations. In both training and inference phases, we adeptly establish and update the hypergraph structure using the Fuzzy C-Means method, ensuring minimal computational burden. This augmentation yields the Vision HyperGraph Neural Network (ViHGNN). The model's efficacy is empirically substantiated through its state-of-the-art performance on both image classification and object detection tasks, courtesy of the hypergraph structure learning module that uncovers higher-order relationships. Our code is available at: https://github.com/VITA-Group/ViHGNN.

Cite

Text

Han et al. "Vision HGNN: An Image Is More than a Graph of Nodes." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01820

Markdown

[Han et al. "Vision HGNN: An Image Is More than a Graph of Nodes." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/han2023iccv-vision/) doi:10.1109/ICCV51070.2023.01820

BibTeX

@inproceedings{han2023iccv-vision,
  title     = {{Vision HGNN: An Image Is More than a Graph of Nodes}},
  author    = {Han, Yan and Wang, Peihao and Kundu, Souvik and Ding, Ying and Wang, Zhangyang},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {19878-19888},
  doi       = {10.1109/ICCV51070.2023.01820},
  url       = {https://mlanthology.org/iccv/2023/han2023iccv-vision/}
}