Multiscale Graph Texture Network

Abstract

Texture recognition has predominantly relied on methods based on handcrafted features and more recently, on Convolutional Neural Network (CNN)-based methods. However, many of these approaches do not capture the underlying directional relationships between visual vocabularies, attributes and features. In this study, we introduce a graph-based deep learning framework for texture and material recognition called Graph Texture Network (GTN) that models the underlying directional associations among latent texture attributes, that are hierarchically related to visual texture attributes, facilitating information exchange among them and consequently improving the discriminative capability among different texture and material categories. GTN, designed to handle non-Euclidean data structures, provides flexibility to learn complex underlying relationships among latent texture attributes via a learnable masked adjacency matrix. To ensure robustness of GTN to noise, especially on graphs with fewer vertices, we facilitate re-calibration of self-loop edge weights to preserve salient texture information within each vertex. We then utilize message passing mechanisms to enrich the representations of latent texture attributes. Furthermore, GTN is able to facilitate interaction across multiple graphs, representing texture information across a range of scales. Finally, GTN can be easily incorporated into a variety of CNN architectures for end-to-end training and does not require fine-tuning of pre-trained CNN backbones. Experimental results demonstrate that GTN achieves state-of-the-art performance on several benchmark texture and material datasets. Our code is available 1 . 1 Code: https://github.com/RavishankarEvani/GTN

Cite

Text

Evani et al. "Multiscale Graph Texture Network." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73411-3_13

Markdown

[Evani et al. "Multiscale Graph Texture Network." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/evani2024eccv-multiscale/) doi:10.1007/978-3-031-73411-3_13

BibTeX

@inproceedings{evani2024eccv-multiscale,
  title     = {{Multiscale Graph Texture Network}},
  author    = {Evani, Ravishankar and Rajan, Deepu and Mao, Shangbo},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73411-3_13},
  url       = {https://mlanthology.org/eccv/2024/evani2024eccv-multiscale/}
}