MMCert: Provable Defense Against Adversarial Attacks to Multi-Modal Models

Yanting Wang, Hongye Fu, Wei Zou, Jinyuan Jia

CVPR 2024 pp. 24655-24664

doi:10.1109/CVPR52733.2024.02328 /cvpr/2024/wang2024cvpr-mmcert/

Abstract

Different from a unimodal model whose input is from a single modality the input (called multi-modal input) of a multi-modal model is from multiple modalities such as image 3D points audio text etc. Similar to unimodal models many existing studies show that a multi-modal model is also vulnerable to adversarial perturbation where an attacker could add small perturbation to all modalities of a multi-modal input such that the multi-modal model makes incorrect predictions for it. Existing certified defenses are mostly designed for unimodal models which achieve sub-optimal certified robustness guarantees when extended to multi-modal models as shown in our experimental results. In our work we propose MMCert the first certified defense against adversarial attacks to a multi-modal model. We derive a lower bound on the performance of our MMCert under arbitrary adversarial attacks with bounded perturbations to both modalities (e.g. in the context of auto-driving we bound the number of changed pixels in both RGB image and depth image). We evaluate our MMCert using two benchmark datasets: one for the multi-modal road segmentation task and the other for the multi-modal emotion recognition task. Moreover we compare our MMCert with a state-of-the-art certified defense extended from unimodal models. Our experimental results show that our MMCert outperforms the baseline.

PDF CVPR Semantic Scholar

Cite

Text

Wang et al. "MMCert: Provable Defense Against Adversarial Attacks to Multi-Modal Models." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02328

Markdown

[Wang et al. "MMCert: Provable Defense Against Adversarial Attacks to Multi-Modal Models." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/wang2024cvpr-mmcert/) doi:10.1109/CVPR52733.2024.02328

BibTeX

@inproceedings{wang2024cvpr-mmcert,
  title     = {{MMCert: Provable Defense Against Adversarial Attacks to Multi-Modal Models}},
  author    = {Wang, Yanting and Fu, Hongye and Zou, Wei and Jia, Jinyuan},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {24655-24664},
  doi       = {10.1109/CVPR52733.2024.02328},
  url       = {https://mlanthology.org/cvpr/2024/wang2024cvpr-mmcert/}
}