Dense Label Encoding for Boundary Discontinuity Free Rotation Detection
Abstract
Rotation detection serves as a fundamental building block in many visual applications involving aerial image, scene text, and face etc. Differing from the dominant regression-based approaches for orientation estimation, this paper explores a relatively less-studied methodology based on classification. The hope is to inherently dismiss the boundary discontinuity issue as encountered by the regression-based detectors. We propose new techniques to push its frontier in two aspects: i) new encoding mechanism: the design of two Densely Coded Labels (DCL) for angle classification, to replace the Sparsely Coded Label (SCL) in existing classification-based detectors, leading to three times training speed increase as empirically observed across benchmarks, further with notable improvement in detection accuracy; ii) loss re-weighting: we propose Angle Distance and Aspect Ratio Sensitive Weighting (ADARSW), which improves the detection accuracy especially for square-like objects, by making DCL-based detectors sensitive to angular distance and object's aspect ratio. Extensive experiments and visual analysis on large-scale public datasets for aerial images i.e. DOTA, UCAS-AOD, HRSC2016, as well as scene text dataset ICDAR2015 and MLT, show the effectiveness of our approach. The source code will be made public available.
Cite
Text
Yang et al. "Dense Label Encoding for Boundary Discontinuity Free Rotation Detection." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01556Markdown
[Yang et al. "Dense Label Encoding for Boundary Discontinuity Free Rotation Detection." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/yang2021cvpr-dense/) doi:10.1109/CVPR46437.2021.01556BibTeX
@inproceedings{yang2021cvpr-dense,
title = {{Dense Label Encoding for Boundary Discontinuity Free Rotation Detection}},
author = {Yang, Xue and Hou, Liping and Zhou, Yue and Wang, Wentao and Yan, Junchi},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {15819-15829},
doi = {10.1109/CVPR46437.2021.01556},
url = {https://mlanthology.org/cvpr/2021/yang2021cvpr-dense/}
}