Generating Synthetic Video Sequences by Explicitly Modeling Object Motion

Abstract

Recent GAN-based video generation approaches model videos as the combination of a time-independent scene component and a time-varying motion component, thus factorizing the generation problem into generating background and foreground separately. One of the main limitations of current approaches is that both factors are learned by mapping one source latent space to videos, which complicates the generation task as a single data point must be informative of both background and foreground content. In this paper we propose a GAN framework for video generation that, instead, employs two latent spaces in order to structure the generative process in a more natural way: (1) a latent space to generate the static visual content of a scene (background), which remains the same for the whole video, and (2) a latent space where motion is encoded as a trajectory between sampled points and whose dynamics are modeled through an RNN encoder (jointly trained with the generator and the discriminator) and then mapped by the generator to visual objects’ motion. Performance evaluation showed that our approach is able to control effectively the generation process as well as to synthesize more realistic videos than state-of-the-art methods.

Cite

Text

Palazzo et al. "Generating Synthetic Video Sequences by Explicitly Modeling Object Motion." European Conference on Computer Vision Workshops, 2018. doi:10.1007/978-3-030-11012-3_37

Markdown

[Palazzo et al. "Generating Synthetic Video Sequences by Explicitly Modeling Object Motion." European Conference on Computer Vision Workshops, 2018.](https://mlanthology.org/eccvw/2018/palazzo2018eccvw-generating/) doi:10.1007/978-3-030-11012-3_37

BibTeX

@inproceedings{palazzo2018eccvw-generating,
  title     = {{Generating Synthetic Video Sequences by Explicitly Modeling Object Motion}},
  author    = {Palazzo, Simone and Spampinato, Concetto and D'Oro, P. and Giordano, Daniela and Shah, Mubarak},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2018},
  pages     = {492-499},
  doi       = {10.1007/978-3-030-11012-3_37},
  url       = {https://mlanthology.org/eccvw/2018/palazzo2018eccvw-generating/}
}