Scale-Aware Automatic Augmentation for Object Detection

Abstract

We propose Scale-aware AutoAug to learn data augmentation policies for object detection. We define a new scale-aware search space, where both image- and box-level augmentations are designed for maintaining scale invariance. Upon this search space, we propose a new search metric, termed Pareto Scale Balance, to facilitate search with high efficiency. In experiments, Scale-aware AutoAug yields significant and consistent improvement on various object detectors (e.g., RetinaNet, Faster R-CNN, Mask R-CNN, and FCOS), even compared with strong multi-scale training baselines. Our searched augmentation policies are transferable to other datasets and box-level tasks beyond object detection (e.g., instance segmentation and keypoint estimation) to improve performance. The search cost is much less than previous automated augmentation approaches for object detection. It is notable that our searched policies have meaningful patterns, which intuitively provide valuable insight for human data augmentation design.

Cite

Text

Chen et al. "Scale-Aware Automatic Augmentation for Object Detection." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00944

Markdown

[Chen et al. "Scale-Aware Automatic Augmentation for Object Detection." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/chen2021cvpr-scaleaware/) doi:10.1109/CVPR46437.2021.00944

BibTeX

@inproceedings{chen2021cvpr-scaleaware,
  title     = {{Scale-Aware Automatic Augmentation for Object Detection}},
  author    = {Chen, Yukang and Li, Yanwei and Kong, Tao and Qi, Lu and Chu, Ruihang and Li, Lei and Jia, Jiaya},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {9563-9572},
  doi       = {10.1109/CVPR46437.2021.00944},
  url       = {https://mlanthology.org/cvpr/2021/chen2021cvpr-scaleaware/}
}