Decoupled Mixup for Out-of-Distribution Visual Recognition

Abstract

Convolutional neural networks (CNN) have demonstrated remarkable performance, when the training and testing data are from the same distribution. However, such trained CNN models often largely degrade on testing data which is unseen and Out-Of-the-Distribution (OOD). To address this issue, we propose a novel “Decoupled-Mixup" method to train CNN models for OOD visual recognition. Different from previous work combining pairs of images homogeneously, our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combine these regions of image pairs to train CNN models. Since the observation is that noise-prone regions such as textural and clutter background are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair. To further improves the generalization ability of trained models, we propose to disentangle discriminative and noise-prone regions in frequency-based and context-based fashions. Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts, where our method achieves 85.76% top-1 accuracy in Track-1 and 79.92% in Track-2 in NICO Challenge. The source code is available at https://github.com/HaozheLiu-ST/NICOChallenge-OOD-Classification .

Cite

Text

Liu et al. "Decoupled Mixup for Out-of-Distribution Visual Recognition." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25075-0_30

Markdown

[Liu et al. "Decoupled Mixup for Out-of-Distribution Visual Recognition." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/liu2022eccvw-decoupled/) doi:10.1007/978-3-031-25075-0_30

BibTeX

@inproceedings{liu2022eccvw-decoupled,
  title     = {{Decoupled Mixup for Out-of-Distribution Visual Recognition}},
  author    = {Liu, Haozhe and Zhang, Wentian and Xie, Jinheng and Wu, Haoqian and Li, Bing and Zhang, Ziqi and Li, Yuexiang and Huang, Yawen and Ghanem, Bernard and Zheng, Yefeng},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2022},
  pages     = {451-464},
  doi       = {10.1007/978-3-031-25075-0_30},
  url       = {https://mlanthology.org/eccvw/2022/liu2022eccvw-decoupled/}
}