Scale-Aware Automatic Augmentation for Object Detection
Abstract
We propose Scale-aware AutoAug to learn data augmentation policies for object detection. We define a new scale-aware search space, where both image- and box-level augmentations are designed for maintaining scale invariance. Upon this search space, we propose a new search metric, termed Pareto Scale Balance, to facilitate search with high efficiency. In experiments, Scale-aware AutoAug yields significant and consistent improvement on various object detectors (e.g., RetinaNet, Faster R-CNN, Mask R-CNN, and FCOS), even compared with strong multi-scale training baselines. Our searched augmentation policies are transferable to other datasets and box-level tasks beyond object detection (e.g., instance segmentation and keypoint estimation) to improve performance. The search cost is much less than previous automated augmentation approaches for object detection. It is notable that our searched policies have meaningful patterns, which intuitively provide valuable insight for human data augmentation design.
Cite
Text
Chen et al. "Scale-Aware Automatic Augmentation for Object Detection." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00944Markdown
[Chen et al. "Scale-Aware Automatic Augmentation for Object Detection." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/chen2021cvpr-scaleaware/) doi:10.1109/CVPR46437.2021.00944BibTeX
@inproceedings{chen2021cvpr-scaleaware,
title = {{Scale-Aware Automatic Augmentation for Object Detection}},
author = {Chen, Yukang and Li, Yanwei and Kong, Tao and Qi, Lu and Chu, Ruihang and Li, Lei and Jia, Jiaya},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {9563-9572},
doi = {10.1109/CVPR46437.2021.00944},
url = {https://mlanthology.org/cvpr/2021/chen2021cvpr-scaleaware/}
}