Cross-Modal Learning with Adversarial Samples

Abstract

With the rapid developments of deep neural networks, numerous deep cross-modal analysis methods have been presented and are being applied in widespread real-world applications, including healthcare and safety-critical environments. However, the recent studies on robustness and stability of deep neural networks show that a microscopic modification, known as adversarial sample, which is even imperceptible to humans, can easily fool a well-performed deep neural network and brings a new obstacle to deep cross-modal correlation exploring. In this paper, we propose a novel Cross-Modal correlation Learning with Adversarial samples, namely CMLA, which for the first time presents the existence of adversarial samples in cross-modal data. Moreover, we provide a simple yet effective adversarial sample learning method, where inter- and intra- modality similarity regularizations across different modalities are simultaneously integrated into the learning of adversarial samples. Finally, our proposed CMLA is demonstrated to be highly effective in cross-modal hashing based retrieval. Extensive experiments on two cross-modal benchmark datasets show that the adversarial examples produced by our CMLA are efficient in fooling a target deep cross-modal hashing network. On the other hand, such adversarial examples can significantly strengthen the robustness of the target network by conducting an adversarial training.

Cite

Text

Li et al. "Cross-Modal Learning with Adversarial Samples." Neural Information Processing Systems, 2019.

Markdown

[Li et al. "Cross-Modal Learning with Adversarial Samples." Neural Information Processing Systems, 2019.](https://mlanthology.org/neurips/2019/li2019neurips-crossmodal/)

BibTeX

@inproceedings{li2019neurips-crossmodal,
  title     = {{Cross-Modal Learning with Adversarial Samples}},
  author    = {Li, Chao and Gao, Shangqian and Deng, Cheng and Xie, De and Liu, Wei},
  booktitle = {Neural Information Processing Systems},
  year      = {2019},
  pages     = {10792-10802},
  url       = {https://mlanthology.org/neurips/2019/li2019neurips-crossmodal/}
}