Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion

Abstract

Infrared and visible image fusion aims to generate a fused image by integrating and distinguishing complementary information from multiple sources. While the cross-attention mechanism with global spatial interactions appears promising it only capture second-order spatial interactions neglecting higher-order interactions in both spatial and channel dimensions. This limitation hampers the exploitation of synergies between multi-modalities. To bridge this gap we introduce a Synergistic High-order Interaction Paradigm (SHIP) designed to systematically investigate spatial fine-grained and global statistics collaborations between infrared and visible images across two fundamental dimensions: 1) Spatial dimension: we construct spatial fine-grained interactions through element-wise multiplication mathematically equivalent to global interactions and then foster high-order formats by iteratively aggregating and evolving complementary information enhancing both efficiency and flexibility. 2) Channel dimension: expanding on channel interactions with first-order statistics (mean) we devise high-order channel interactions to facilitate the discernment of inter-dependencies between source images based on global statistics. Harnessing high-order interactions significantly enhances our model's ability to exploit multi-modal synergies leading in superior performance over state-of-the-art alternatives as shown through comprehensive experiments across various benchmarks.

Cite

Text

Zheng et al. "Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02492

Markdown

[Zheng et al. "Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/zheng2024cvpr-probing/) doi:10.1109/CVPR52733.2024.02492

BibTeX

@inproceedings{zheng2024cvpr-probing,
  title     = {{Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion}},
  author    = {Zheng, Naishan and Zhou, Man and Huang, Jie and Hou, Junming and Li, Haoying and Xu, Yuan and Zhao, Feng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {26384-26395},
  doi       = {10.1109/CVPR52733.2024.02492},
  url       = {https://mlanthology.org/cvpr/2024/zheng2024cvpr-probing/}
}