Open-Vocabulary Attribute Detection

Bravo, María A.; Mittal, Sudhanshu; Ging, Simon; Brox, Thomas

doi:10.1109/CVPR52729.2023.00680

Open-Vocabulary Attribute Detection

María A. Bravo, Sudhanshu Mittal, Simon Ging, Thomas Brox

CVPR 2023 pp. 7041-7050

doi:10.1109/CVPR52729.2023.00680 /cvpr/2023/bravo2023cvpr-openvocabulary/

Abstract

Vision-language modeling has enabled open-vocabulary tasks where predictions can be queried using any text prompt in a zero-shot manner. Existing open-vocabulary tasks focus on object classes, whereas research on object attributes is limited due to the lack of a reliable attribute-focused evaluation benchmark. This paper introduces the Open-Vocabulary Attribute Detection (OVAD) task and the corresponding OVAD benchmark. The objective of the novel task and benchmark is to probe object-level attribute information learned by vision-language models. To this end, we created a clean and densely annotated test set covering 117 attribute classes on the 80 object classes of MS COCO. It includes positive and negative annotations, which enables open-vocabulary evaluation. Overall, the benchmark consists of 1.4 million annotations. For reference, we provide a first baseline method for open-vocabulary attribute detection. Moreover, we demonstrate the benchmark's value by studying the attribute detection performance of several foundation models.

PDF CVPR Semantic Scholar

Cite

Text

Bravo et al. "Open-Vocabulary Attribute Detection." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00680

Markdown

[Bravo et al. "Open-Vocabulary Attribute Detection." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/bravo2023cvpr-openvocabulary/) doi:10.1109/CVPR52729.2023.00680

BibTeX

@inproceedings{bravo2023cvpr-openvocabulary,
  title     = {{Open-Vocabulary Attribute Detection}},
  author    = {Bravo, María A. and Mittal, Sudhanshu and Ging, Simon and Brox, Thomas},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {7041-7050},
  doi       = {10.1109/CVPR52729.2023.00680},
  url       = {https://mlanthology.org/cvpr/2023/bravo2023cvpr-openvocabulary/}
}