Modelling Complex Vector Drawings with Stroke-Clouds

Abstract

Vector drawings are innately interactive as they preserve creational cues. Despite this desirable property they remain relatively under explored due to the difficulties in modeling complex vector drawings. This is in part due to the primarily _sequential and auto-regressive nature_ of existing approaches failing to scale beyond simple drawings. In this paper, we define generative models over _highly complex_ vector drawings by first representing them as “stroke-clouds” – _sets_ of arbitrary cardinality comprised of semantically meaningful strokes. The dimensionality of the strokes is a design choice that allows the model to adapt to a range of complexities. We learn to encode these _set of strokes_ into compact latent codes by a probabilistic reconstruction procedure backed by _De-Finetti’s Theorem of Exchangability_. The parametric generative model is then defined over the latent vectors of the encoded stroke-clouds. The resulting “Latent stroke-cloud generator (LSG)” thus captures the distribution of complex vector drawings on an implicit _set space_. We demonstrate the efficacy of our model on complex drawings (a newly created Anime line-art dataset) through a range of generative tasks.

Cite

Text

Ashcroft et al. "Modelling Complex Vector Drawings with Stroke-Clouds." International Conference on Learning Representations, 2024.

Markdown

[Ashcroft et al. "Modelling Complex Vector Drawings with Stroke-Clouds." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/ashcroft2024iclr-modelling/)

BibTeX

@inproceedings{ashcroft2024iclr-modelling,
  title     = {{Modelling Complex Vector Drawings with Stroke-Clouds}},
  author    = {Ashcroft, Alexander and Das, Ayan and Gryaditskaya, Yulia and Qu, Zhiyu and Song, Yi-Zhe},
  booktitle = {International Conference on Learning Representations},
  year      = {2024},
  url       = {https://mlanthology.org/iclr/2024/ashcroft2024iclr-modelling/}
}