Fast Interactive Object Annotation with Curve-GCN
Abstract
Manually labeling objects by tracing their boundaries is a laborious process. In Polygon-RNN++, the authors proposed Polygon-RNN that produces polygonal annotations in a recurrent manner using a CNN-RNN architecture, allowing interactive correction via humans-in-the-loop. We propose a new framework that alleviates the sequential nature of Polygon-RNN, by predicting all vertices simultaneously using a Graph Convolutional Network (GCN). Our model is trained end-to-end, and runs in real time. It supports object annotation by either polygons or splines, facilitating labeling efficiency for both line-based and curved objects. We show that Curve-GCN outperforms all existing approaches in automatic mode, including the powerful DeepLab, and is significantly more efficient in interactive mode than Polygon-RNN++. Our model runs at 29.3ms in automatic, and 2.6ms in interactive mode, making it 10x and 100x faster than Polygon-RNN++.
Cite
Text
Ling et al. "Fast Interactive Object Annotation with Curve-GCN." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00540Markdown
[Ling et al. "Fast Interactive Object Annotation with Curve-GCN." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/ling2019cvpr-fast/) doi:10.1109/CVPR.2019.00540BibTeX
@inproceedings{ling2019cvpr-fast,
title = {{Fast Interactive Object Annotation with Curve-GCN}},
author = {Ling, Huan and Gao, Jun and Kar, Amlan and Chen, Wenzheng and Fidler, Sanja},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00540},
url = {https://mlanthology.org/cvpr/2019/ling2019cvpr-fast/}
}