SSML-QNet: Scale-Separative Metric Learning Quadruplet Network for Multi-Modal Image Patch Matching

Abstract

Multi-modal image matching is very challenging due to the significant diversities in visual appearance of different modal images. Typically, the existing well-performed methods mainly focus on learning invariant and discriminative features for measuring the relation between multi-modal image pairs. However, these methods often take the features as a whole and largely overlook the fact that different scale features for a same image pair may have different similarity, which may lead to sub-optimal results only. In this work, we propose a Scale-Separative Metric Learning Quadruplet network (SSML-QNet) for multi-modal image patch matching. Specifically, SSML-QNet can extract both relevant and irrelevant features of imaging modality with the proposed quadruplet network architecture. Then, the proposed Scale-Separative Metric Learning module separately encodes the similarity of different scale features with the pyramid structure. And for each scale, cross-modal consistent features are extracted and measured by coordinate and channel-wise attention sequentially. This makes our network robust to appearance divergence caused by different imaging mechanism. Experiments on the benchmark dataset (VIS-NIR, VIS-LWIR, Optical-SAR, and Brown) have verified that the proposed SSML-QNet is able to outperform other state-of-the-art methods. Furthermore, the cross-dataset transferring experiments on these four datasets also have shown that the proposed method has powerful ability of cross-dataset transferring.

Cite

Text

Zhang et al. "SSML-QNet: Scale-Separative Metric Learning Quadruplet Network for Multi-Modal Image Patch Matching." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/511

Markdown

[Zhang et al. "SSML-QNet: Scale-Separative Metric Learning Quadruplet Network for Multi-Modal Image Patch Matching." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/zhang2023ijcai-ssml/) doi:10.24963/IJCAI.2023/511

BibTeX

@inproceedings{zhang2023ijcai-ssml,
  title     = {{SSML-QNet: Scale-Separative Metric Learning Quadruplet Network for Multi-Modal Image Patch Matching}},
  author    = {Zhang, Xiuwei and Sun, Yi and Han, Yamin and Li, Yanping and Yin, Hanlin and Xing, Yinghui and Zhang, Yanning},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {4593-4601},
  doi       = {10.24963/IJCAI.2023/511},
  url       = {https://mlanthology.org/ijcai/2023/zhang2023ijcai-ssml/}
}