Image-Level or Object-Level? a Tale of Two Resampling Strategies for Long-Tailed Detection
Abstract
Training on datasets with long-tailed distributions has been challenging for major recognition tasks such as classification and detection. To deal with this challenge, image resampling is typically introduced as a simple but effective approach. However, we observe that long-tailed detection differs from classification since multiple classes may be present in one image. As a result, image resampling alone is not enough to yield a sufficiently balanced distribution at the object-level. We address object-level resampling by introducing an object-centric sampling strategy based on a dynamic, episodic memory bank. Our proposed strategy has two benefits: 1) convenient object-level resampling without significant extra computation, and 2) implicit feature-level augmentation from model updates. We show that image-level and object-level resamplings are both important, and thus unify them with a joint resampling strategy. Our method achieves state-of-the-art performance on the rare categories of LVIS, with 1.89% and 3.13% relative improvements over Forest R-CNN on detection and instance segmentation.
Cite
Text
Chang et al. "Image-Level or Object-Level? a Tale of Two Resampling Strategies for Long-Tailed Detection." International Conference on Machine Learning, 2021.Markdown
[Chang et al. "Image-Level or Object-Level? a Tale of Two Resampling Strategies for Long-Tailed Detection." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/chang2021icml-imagelevel/)BibTeX
@inproceedings{chang2021icml-imagelevel,
title = {{Image-Level or Object-Level? a Tale of Two Resampling Strategies for Long-Tailed Detection}},
author = {Chang, Nadine and Yu, Zhiding and Wang, Yu-Xiong and Anandkumar, Animashree and Fidler, Sanja and Alvarez, Jose M},
booktitle = {International Conference on Machine Learning},
year = {2021},
pages = {1463-1472},
volume = {139},
url = {https://mlanthology.org/icml/2021/chang2021icml-imagelevel/}
}