DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization
Abstract
Human-designed data augmentation strategies havebeen replaced by automatically learned augmentation pol-icy in the past two years. Specifically, recent works haveexperimentally shown that the superior performance of theautomated methods stems from increasing the diversity ofaugmented data. However, two factors regard-ing the diversity of augmented data are still missing: 1)the explicit definition (and thus measurement) of diversityand 2) the quantifiable relationship between diversity andits regularization effects. To fill this gap, we propose a di-versity measure called "Variance Diversity" and theoreti-cally show that the regularization effect of data augmenta-tion is promised by Variance Diversity. We confirm in exper-iments that the relative gain from automated data augmen-tation in test accuracy of a given model is highly correlatedto Variance Diversity. To improve the search process ofautomated augmentation, an unsupervised sampling-basedframework,DivAug, is designed to directly optimize Vari-ance Diversity and hence strengthen the regularization ef-fect. Without requiring a separate search process, the per-formance gain from DivAug is comparable with state-of-the-art method with better efficiency. Moreover, under thesemi-supervised setting, our framework can further improvethe performance of semi-supervised learning algorithmsbased on RandAugment, making it highly applicable to real-world problems, where labeled data is scarce. The code is available at https://github.com/warai-0toko/DivAug.
Cite
Text
Liu et al. "DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00472Markdown
[Liu et al. "DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/liu2021iccv-divaug/) doi:10.1109/ICCV48922.2021.00472BibTeX
@inproceedings{liu2021iccv-divaug,
title = {{DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization}},
author = {Liu, Zirui and Jin, Haifeng and Wang, Ting-Hsiang and Zhou, Kaixiong and Hu, Xia},
booktitle = {International Conference on Computer Vision},
year = {2021},
pages = {4762-4770},
doi = {10.1109/ICCV48922.2021.00472},
url = {https://mlanthology.org/iccv/2021/liu2021iccv-divaug/}
}