Leveraging Unlabeled Data for Sketch-Based Understanding

Abstract

Sketch-based understanding is a critical component of human cognitive learning and is a primitive communication means between humans. This topic has recently attracted the interest of the computer vision community as sketching represents a powerful tool to express static objects and dynamic scenes. Unfortunately, despite its broad application domains, the current sketch-based models strongly rely on labels for supervised training, ignoring knowledge from unlabeled data, thus limiting the underlying generalization and the applicability. Therefore, we present a study about the use of unlabeled data to improve a sketch-based model. To this end, we evaluate variations of VAE and semi-supervised VAE, and present an extension of BYOL to deal with sketches. Our results show the superiority of sketch-BYOL, which outperforms other self-supervised approaches increasing the retrieval performance for known and unknown categories. Furthermore, we show how other tasks can benefit from our proposal.

Cite

Text

Morales et al. "Leveraging Unlabeled Data for Sketch-Based Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00563

Markdown

[Morales et al. "Leveraging Unlabeled Data for Sketch-Based Understanding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/morales2022cvprw-leveraging/) doi:10.1109/CVPRW56347.2022.00563

BibTeX

@inproceedings{morales2022cvprw-leveraging,
  title     = {{Leveraging Unlabeled Data for Sketch-Based Understanding}},
  author    = {Morales, Javier and Murrugarra-Llerena, Nils and Saavedra, José M.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {5149-5158},
  doi       = {10.1109/CVPRW56347.2022.00563},
  url       = {https://mlanthology.org/cvprw/2022/morales2022cvprw-leveraging/}
}