Channel Augmented Joint Learning for Visible-Infrared Recognition

Abstract

This paper introduces a powerful channel augmented joint learning strategy for the visible-infrared recognition problem. For data augmentation, most existing methods directly adopt the standard operations designed for single-modality visible images, and thus do not fully consider the imagery properties in visible to infrared matching. Our basic idea is to homogenously generate color-irrelevant images by randomly exchanging the color channels. It can be seamlessly integrated into existing augmentation operations without modifying the network, consistently improving the robustness against color variations. Incorporated with a random erasing strategy, it further greatly enriches the diversity by simulating random occlusions. For cross-modality metric learning, we design an enhanced channel-mixed learning strategy to simultaneously handle the intra- and cross-modality variations with squared difference for stronger discriminability. Besides, a channel-augmented joint learning strategy is further developed to explicitly optimize the outputs of augmented images. Extensive experiments with insightful analysis on two visible-infrared recognition tasks show that the proposed strategies consistently improve the accuracy. Without auxiliary information, it improves the state-of-the-art Rank-1/mAP by 14.59%/13.00% on the large-scale SYSU-MM01 dataset.

Cite

Text

Ye et al. "Channel Augmented Joint Learning for Visible-Infrared Recognition." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.01331

Markdown

[Ye et al. "Channel Augmented Joint Learning for Visible-Infrared Recognition." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/ye2021iccv-channel/) doi:10.1109/ICCV48922.2021.01331

BibTeX

@inproceedings{ye2021iccv-channel,
  title     = {{Channel Augmented Joint Learning for Visible-Infrared Recognition}},
  author    = {Ye, Mang and Ruan, Weijian and Du, Bo and Shou, Mike Zheng},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {13567-13576},
  doi       = {10.1109/ICCV48922.2021.01331},
  url       = {https://mlanthology.org/iccv/2021/ye2021iccv-channel/}
}