Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification
Abstract
RGB-infrared person re-identification is an emerging cross-modality re-identification task, which is very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup and invariant decomposition (MID) approach for RGB-infrared person re-identification towards learning modality-invariant and discriminative representations. MID designs a modality-adaptive mixup scheme to generate suitable mixed modality images between RGB and infrared images for mitigating the inherent modality discrepancy at the pixel-level. It formulates modality mixup procedure as Markov decision process, where an actor-critic agent learns dynamical and local linear interpolation policy between different regions of cross-modality images under a deep reinforcement learning framework. Such policy guarantees modality-invariance in a more continuous latent space and avoids manifold intrusion by the corrupted mixed modality samples. Moreover, to further counter modality discrepancy and enforce invariant visual semantics at the feature-level, MID employs modality-adaptive convolution decomposition to disassemble a regular convolution layer into modality-specific basis layers and a modality-shared coefficient layer. Extensive experimental results on two challenging benchmarks demonstrate superior performance of MID over state-of-the-art methods.
Cite
Text
Huang et al. "Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I1.19987Markdown
[Huang et al. "Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/huang2022aaai-modality/) doi:10.1609/AAAI.V36I1.19987BibTeX
@inproceedings{huang2022aaai-modality,
title = {{Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification}},
author = {Huang, Zhipeng and Liu, Jiawei and Li, Liang and Zheng, Kecheng and Zha, Zheng-Jun},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {1034-1042},
doi = {10.1609/AAAI.V36I1.19987},
url = {https://mlanthology.org/aaai/2022/huang2022aaai-modality/}
}