Temporal Coherency Based Criteria for Predicting Video Frames Using Deep Multi-Stage Generative Adversarial Networks

Abstract

Predicting the future from a sequence of video frames has been recently a sought after yet challenging task in the field of computer vision and machine learning. Although there have been efforts for tracking using motion trajectories and flow features, the complex problem of generating unseen frames has not been studied extensively. In this paper, we deal with this problem using convolutional models within a multi-stage Generative Adversarial Networks (GAN) framework. The proposed method uses two stages of GANs to generate a crisp and clear set of future frames. Although GANs have been used in the past for predicting the future, none of the works consider the relation between subsequent frames in the temporal dimension. Our main contribution lies in formulating two objective functions based on the Normalized Cross Correlation (NCC) and the Pairwise Contrastive Divergence (PCD) for solving this problem. This method, coupled with the traditional L1 loss, has been experimented with three real-world video datasets, viz. Sports-1M, UCF-101 and the KITTI. Performance analysis reveals superior results over the recent state-of-the-art methods.

Cite

Text

Bhattacharjee and Das. "Temporal Coherency Based Criteria for Predicting Video Frames Using Deep Multi-Stage Generative Adversarial Networks." Neural Information Processing Systems, 2017.

Markdown

[Bhattacharjee and Das. "Temporal Coherency Based Criteria for Predicting Video Frames Using Deep Multi-Stage Generative Adversarial Networks." Neural Information Processing Systems, 2017.](https://mlanthology.org/neurips/2017/bhattacharjee2017neurips-temporal/)

BibTeX

@inproceedings{bhattacharjee2017neurips-temporal,
  title     = {{Temporal Coherency Based Criteria for Predicting Video Frames Using Deep Multi-Stage Generative Adversarial Networks}},
  author    = {Bhattacharjee, Prateep and Das, Sukhendu},
  booktitle = {Neural Information Processing Systems},
  year      = {2017},
  pages     = {4268-4277},
  url       = {https://mlanthology.org/neurips/2017/bhattacharjee2017neurips-temporal/}
}