Fine-Grained Dynamic Head for Object Detection

Abstract

The Feature Pyramid Network (FPN) presents a remarkable approach to alleviate the scale variance in object representation by performing instance-level assignments. Nevertheless, this strategy ignores the distinct characteristics of different sub-regions in an instance. To this end, we propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance, which further releases the ability of multi-scale feature representation. Moreover, we design a spatial gate with the new activation function to reduce computational complexity dramatically through spatially sparse convolutions. Extensive experiments demonstrate the effectiveness and efficiency of the proposed method on several state-of-the-art detection benchmarks. Code is available at https://github.com/StevenGrove/DynamicHead.

Cite

Text

Song et al. "Fine-Grained Dynamic Head for Object Detection." Neural Information Processing Systems, 2020.

Markdown

[Song et al. "Fine-Grained Dynamic Head for Object Detection." Neural Information Processing Systems, 2020.](https://mlanthology.org/neurips/2020/song2020neurips-finegrained/)

BibTeX

@inproceedings{song2020neurips-finegrained,
  title     = {{Fine-Grained Dynamic Head for Object Detection}},
  author    = {Song, Lin and Li, Yanwei and Jiang, Zhengkai and Li, Zeming and Sun, Hongbin and Sun, Jian and Zheng, Nanning},
  booktitle = {Neural Information Processing Systems},
  year      = {2020},
  url       = {https://mlanthology.org/neurips/2020/song2020neurips-finegrained/}
}