Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
Abstract
Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is a novel cross-modal retrieval task, where abstract sketches are used as queries to retrieve natural images under zero-shot scenario. Most existing methods regard ZS-SBIR as a traditional classification problem and employ a cross-entropy or triplet-based loss to achieve retrieval, which neglect the problems of the domain gap between sketches and natural images and the large intra-class diversity in sketches. Toward this end, we propose a novel Domain-Smoothing Network (DSN) for ZS-SBIR. Specifically, a cross-modal contrastive method is proposed to learn generalized representations to smooth the domain gap by mining relations with additional augmented samples. Furthermore, a category-specific memory bank with sketch features is explored to reduce intra-class diversity in the sketch domain. Extensive experiments demonstrate that our approach notably outperforms the state-of-the-art methods in both Sketchy and TU-Berlin datasets.
Cite
Text
Wang et al. "Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/158Markdown
[Wang et al. "Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/wang2021ijcai-domain/) doi:10.24963/IJCAI.2021/158BibTeX
@inproceedings{wang2021ijcai-domain,
title = {{Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval}},
author = {Wang, Zhipeng and Wang, Hao and Yan, Jiexi and Wu, Aming and Deng, Cheng},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2021},
pages = {1143-1149},
doi = {10.24963/IJCAI.2021/158},
url = {https://mlanthology.org/ijcai/2021/wang2021ijcai-domain/}
}