Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection

Abstract

In this paper, we tackle the challenging problem of Few-shot Object Detection. Existing FSOD pipelines (i) use average-pooled representations that result in information loss; and/or (ii) discard position information that can help detect object instances. Consequently, such pipelines are sensitive to large intra-class appearance and geometric variations between support and query images. To address these drawbacks, we propose a Time-rEversed diffusioN tEnsor Transformer (TENET), which i) forms high-order tensor representations that capture multi-way feature occurrences that are highly discriminative, and ii) uses a transformer that dynamically extracts correlations between the query image and the entire support set, instead of a single average-pooled support embedding. We also propose a Transformer Relation Head (TRH), equipped with higher-order representations, which encodes correlations between query regions and the entire support set, while being sensitive to the positional variability of object instances. Our model achieves state-of-the-art results on PASCAL VOC, FSOD, and COCO.

Cite

Text

Zhang et al. "Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20044-1_18

Markdown

[Zhang et al. "Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/zhang2022eccv-timereversed/) doi:10.1007/978-3-031-20044-1_18

BibTeX

@inproceedings{zhang2022eccv-timereversed,
  title     = {{Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection}},
  author    = {Zhang, Shan and Murray, Naila and Wang, Lei and Koniusz, Piotr},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-20044-1_18},
  url       = {https://mlanthology.org/eccv/2022/zhang2022eccv-timereversed/}
}