Controllable Video Generation Through Global and Local Motion Dynamics

Abstract

We present GLASS, a method for Global and Local Action-driven Sequence Synthesis. GLASS is a generative model that is trained on video sequences in an unsupervised manner and that can animate an input image at test time. The method learns to segment frames into foreground-background layers and to generate transitions of the foregrounds over time through a global and local action representation. Global actions are explicitly related to 2D shifts, while local actions are instead related to (both geometric and photometric) local deformations. GLASS uses a recurrent neural network to transition between frames and is trained through a reconstruction loss. We also introduce W-Sprites (Walking Sprites), a novel synthetic dataset with a predefined action space. We evaluate our method on both W-Sprites and real datasets, and find that GLASS is able to generate realistic video sequences from a single input image and to successfully learn a more advanced action space than in prior work. Further details, the code and example videos are available at https://araachie.github.io/glass/.

Cite

Text

Davtyan and Favaro. "Controllable Video Generation Through Global and Local Motion Dynamics." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19790-1_5

Markdown

[Davtyan and Favaro. "Controllable Video Generation Through Global and Local Motion Dynamics." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/davtyan2022eccv-controllable/) doi:10.1007/978-3-031-19790-1_5

BibTeX

@inproceedings{davtyan2022eccv-controllable,
  title     = {{Controllable Video Generation Through Global and Local Motion Dynamics}},
  author    = {Davtyan, Aram and Favaro, Paolo},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19790-1_5},
  url       = {https://mlanthology.org/eccv/2022/davtyan2022eccv-controllable/}
}