Decoupled Mixup for Out-of-Distribution Visual Recognition
Abstract
Convolutional neural networks (CNN) have demonstrated remarkable performance, when the training and testing data are from the same distribution. However, such trained CNN models often largely degrade on testing data which is unseen and Out-Of-the-Distribution (OOD). To address this issue, we propose a novel “Decoupled-Mixup" method to train CNN models for OOD visual recognition. Different from previous work combining pairs of images homogeneously, our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combine these regions of image pairs to train CNN models. Since the observation is that noise-prone regions such as textural and clutter background are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair. To further improves the generalization ability of trained models, we propose to disentangle discriminative and noise-prone regions in frequency-based and context-based fashions. Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts, where our method achieves 85.76% top-1 accuracy in Track-1 and 79.92% in Track-2 in NICO Challenge. The source code is available at https://github.com/HaozheLiu-ST/NICOChallenge-OOD-Classification .
Cite
Text
Liu et al. "Decoupled Mixup for Out-of-Distribution Visual Recognition." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25075-0_30Markdown
[Liu et al. "Decoupled Mixup for Out-of-Distribution Visual Recognition." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/liu2022eccvw-decoupled/) doi:10.1007/978-3-031-25075-0_30BibTeX
@inproceedings{liu2022eccvw-decoupled,
title = {{Decoupled Mixup for Out-of-Distribution Visual Recognition}},
author = {Liu, Haozhe and Zhang, Wentian and Xie, Jinheng and Wu, Haoqian and Li, Bing and Zhang, Ziqi and Li, Yuexiang and Huang, Yawen and Ghanem, Bernard and Zheng, Yefeng},
booktitle = {European Conference on Computer Vision Workshops},
year = {2022},
pages = {451-464},
doi = {10.1007/978-3-031-25075-0_30},
url = {https://mlanthology.org/eccvw/2022/liu2022eccvw-decoupled/}
}