Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts
Abstract
The challenge of Out-Of-Distribution (OOD) robustness remains a critical hurdle towards deploying deep vision models. Vision-Language Models (VLMs) have recently achieved groundbreaking results. VLM-based open-vocabulary object detection extends the capabilities of traditional object detection frameworks, enabling the recognition and classification of objects beyond predefined categories. Investigating OOD robustness in recent open-vocabulary object detection is essential to increase the trustworthiness of these models. This study presents a comprehensive robustness evaluation of the zero-shot capabilities of three recent open-vocabulary (OV) foundation object detection models: OWL-ViT, YOLO World, and Grounding DINO. Experiments carried out on the robustness benchmarks COCO-O, COCO-DC, and COCO-C encompassing distribution shifts due to information loss, corruption, adversarial attacks, and geometrical deformation, highlighting the challenges of the model's robustness to foster the research for achieving robustness. Project page: https://prakashchhipa.github.io/projects/ovod_robustness
Cite
Text
Chhipa et al. "Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91672-4_5Markdown
[Chhipa et al. "Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/chhipa2024eccvw-openvocabulary/) doi:10.1007/978-3-031-91672-4_5BibTeX
@inproceedings{chhipa2024eccvw-openvocabulary,
title = {{Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts}},
author = {Chhipa, Prakash Chandra and De, Kanjar and Chippa, Meenakshi Subhash and Saini, Rajkumar and Liwicki, Marcus},
booktitle = {European Conference on Computer Vision Workshops},
year = {2024},
pages = {62-79},
doi = {10.1007/978-3-031-91672-4_5},
url = {https://mlanthology.org/eccvw/2024/chhipa2024eccvw-openvocabulary/}
}