Leveraging Unlabeled Data for Sketch-Based Understanding
Abstract
Sketch-based understanding is a critical component of human cognitive learning and is a primitive communication means between humans. This topic has recently attracted the interest of the computer vision community as sketching represents a powerful tool to express static objects and dynamic scenes. Unfortunately, despite its broad application domains, the current sketch-based models strongly rely on labels for supervised training, ignoring knowledge from unlabeled data, thus limiting the underlying generalization and the applicability. Therefore, we present a study about the use of unlabeled data to improve a sketch-based model. To this end, we evaluate variations of VAE and semi-supervised VAE, and present an extension of BYOL to deal with sketches. Our results show the superiority of sketch-BYOL, which outperforms other self-supervised approaches increasing the retrieval performance for known and unknown categories. Furthermore, we show how other tasks can benefit from our proposal.
Cite
Text
Morales et al. "Leveraging Unlabeled Data for Sketch-Based Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00563Markdown
[Morales et al. "Leveraging Unlabeled Data for Sketch-Based Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/morales2022cvprw-leveraging/) doi:10.1109/CVPRW56347.2022.00563BibTeX
@inproceedings{morales2022cvprw-leveraging,
title = {{Leveraging Unlabeled Data for Sketch-Based Understanding}},
author = {Morales, Javier and Murrugarra-Llerena, Nils and Saavedra, José M.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2022},
pages = {5149-5158},
doi = {10.1109/CVPRW56347.2022.00563},
url = {https://mlanthology.org/cvprw/2022/morales2022cvprw-leveraging/}
}