Enhanced Training of Query-Based Object Detection via Selective Query Recollection

Abstract

This paper investigates a phenomenon where query-based object detectors mispredict at the last decoding stage while predicting correctly at an intermediate stage. We review the training process and attribute the overlooked phenomenon to two limitations: lack of training emphasis and cascading errors from decoding sequence. We design and present Selective Query Recollection (SQR), a simple and effective training strategy for query-based object detectors. It cumulatively collects intermediate queries as decoding stages go deeper and selectively forwards the queries to the downstream stages aside from the sequential structure. Such-wise, SQR places training emphasis on later stages and allows later stages to work with intermediate queries from earlier stages directly. SQR can be easily plugged into various query-based object detectors and significantly enhances their performance while leaving the inference pipeline unchanged. As a result, we apply SQR on Adamixer, DAB-DETR, and Deformable-DETR across various settings (backbone, number of queries, schedule) and consistently brings 1.4 2.8 AP improvement.

Cite

Text

Chen et al. "Enhanced Training of Query-Based Object Detection via Selective Query Recollection." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02275

Markdown

[Chen et al. "Enhanced Training of Query-Based Object Detection via Selective Query Recollection." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/chen2023cvpr-enhanced/) doi:10.1109/CVPR52729.2023.02275

BibTeX

@inproceedings{chen2023cvpr-enhanced,
  title     = {{Enhanced Training of Query-Based Object Detection via Selective Query Recollection}},
  author    = {Chen, Fangyi and Zhang, Han and Hu, Kai and Huang, Yu-Kai and Zhu, Chenchen and Savvides, Marios},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {23756-23765},
  doi       = {10.1109/CVPR52729.2023.02275},
  url       = {https://mlanthology.org/cvpr/2023/chen2023cvpr-enhanced/}
}