Simple Multi-Dataset Detection

Abstract

How do we build a general and broad object detection system? We use all labels of all concepts ever annotated. These labels span diverse datasets with potentially inconsistent taxonomies. In this paper, we present a simple method for training a unified detector on multiple large-scale datasets. We use dataset-specific training protocols and losses, but share a common detection architecture with dataset-specific outputs. We show how to automatically integrate these dataset-specific outputs into a common semantic taxonomy. In contrast to prior work, our approach does not require manual taxonomy reconciliation. Experiments show our learned taxonomy outperforms a expert-designed taxonomy in all datasets. Our multi-dataset detector performs as well as dataset-specific models on each training domain, and can generalize to new unseen dataset without fine-tuning on them. Code is available at https://github.com/xingyizhou/UniDet.

Cite

Text

Zhou et al. "Simple Multi-Dataset Detection." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00742

Markdown

[Zhou et al. "Simple Multi-Dataset Detection." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/zhou2022cvpr-simple/) doi:10.1109/CVPR52688.2022.00742

BibTeX

@inproceedings{zhou2022cvpr-simple,
  title     = {{Simple Multi-Dataset Detection}},
  author    = {Zhou, Xingyi and Koltun, Vladlen and Krähenbühl, Philipp},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {7571-7580},
  doi       = {10.1109/CVPR52688.2022.00742},
  url       = {https://mlanthology.org/cvpr/2022/zhou2022cvpr-simple/}
}