SketchINR: A First Look into Sketches as Implicit Neural Representations
Abstract
We propose SketchINR to advance the representation of vector sketches with implicit neural models. A variable length vector sketch is compressed into a latent space of fixed dimension that implicitly encodes the underlying shape as a function of time and strokes. The learned function predicts the xy point coordinates in a sketch at each time and stroke. Despite its simplicity SketchINR outperforms existing representations at multiple tasks: (i) Encoding an entire sketch dataset into a fixed size latent vector SketchINR gives 60x and 10x data compression over raster and vector sketches respectively. (ii) SketchINR's auto-decoder provides a much higher-fidelity representation than other learned vector sketch representations and is uniquely able to scale to complex vector sketches such as FS-COCO. (iii) SketchINR supports parallelisation that can decode/render 100x faster than other learned vector representations such as SketchRNN. (iv) SketchINR for the first time emulates the human ability to reproduce a sketch with varying abstraction in terms of number and complexity of strokes. As a first look at implicit sketches SketchINR's compact high-fidelity representation will support future work in modelling long and complex sketches.
Cite
Text
Bandyopadhyay et al. "SketchINR: A First Look into Sketches as Implicit Neural Representations." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01194Markdown
[Bandyopadhyay et al. "SketchINR: A First Look into Sketches as Implicit Neural Representations." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/bandyopadhyay2024cvpr-sketchinr/) doi:10.1109/CVPR52733.2024.01194BibTeX
@inproceedings{bandyopadhyay2024cvpr-sketchinr,
title = {{SketchINR: A First Look into Sketches as Implicit Neural Representations}},
author = {Bandyopadhyay, Hmrishav and Bhunia, Ayan Kumar and Chowdhury, Pinaki Nath and Sain, Aneeshan and Xiang, Tao and Hospedales, Timothy and Song, Yi-Zhe},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {12565-12574},
doi = {10.1109/CVPR52733.2024.01194},
url = {https://mlanthology.org/cvpr/2024/bandyopadhyay2024cvpr-sketchinr/}
}