DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization

Abstract

Human-designed data augmentation strategies havebeen replaced by automatically learned augmentation pol-icy in the past two years. Specifically, recent works haveexperimentally shown that the superior performance of theautomated methods stems from increasing the diversity ofaugmented data. However, two factors regard-ing the diversity of augmented data are still missing: 1)the explicit definition (and thus measurement) of diversityand 2) the quantifiable relationship between diversity andits regularization effects. To fill this gap, we propose a di-versity measure called "Variance Diversity" and theoreti-cally show that the regularization effect of data augmenta-tion is promised by Variance Diversity. We confirm in exper-iments that the relative gain from automated data augmen-tation in test accuracy of a given model is highly correlatedto Variance Diversity. To improve the search process ofautomated augmentation, an unsupervised sampling-basedframework,DivAug, is designed to directly optimize Vari-ance Diversity and hence strengthen the regularization ef-fect. Without requiring a separate search process, the per-formance gain from DivAug is comparable with state-of-the-art method with better efficiency. Moreover, under thesemi-supervised setting, our framework can further improvethe performance of semi-supervised learning algorithmsbased on RandAugment, making it highly applicable to real-world problems, where labeled data is scarce. The code is available at https://github.com/warai-0toko/DivAug.

Cite

Text

Liu et al. "DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00472

Markdown

[Liu et al. "DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/liu2021iccv-divaug/) doi:10.1109/ICCV48922.2021.00472

BibTeX

@inproceedings{liu2021iccv-divaug,
  title     = {{DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization}},
  author    = {Liu, Zirui and Jin, Haifeng and Wang, Ting-Hsiang and Zhou, Kaixiong and Hu, Xia},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {4762-4770},
  doi       = {10.1109/ICCV48922.2021.00472},
  url       = {https://mlanthology.org/iccv/2021/liu2021iccv-divaug/}
}