DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction

Abstract

Videos inherently contain complex temporal dynamics across various spatial directions, often entangled in ways that obscure effective dynamic extraction. Previous studies typically process video spatiotemporal features without disentangling, which hampers their ability to extract dynamic information. Additionally, the extraction of dynamics is disrupted by transient high-dynamic information in video sequences, e.g., noise or flicker, which has received limited attention in the literature. To tackle those problems, this paper proposes the Disentangling and Filtering Dynamics Network (DFDNet). Firstly, to disentangle the interwoven dynamics, DFDNet decomposes the spatially encoded video sequences into lower dimensional sequences. Secondly, a learnable threshold filter is proposed to eliminate the transient high-dynamic information. Thirdly, the model incorporates an MLP to extract the temporal dependencies from the disentangled and filtered sequences. DFDNet demonstrates competitive performance across four chosen datasets, including both low and high-resolution videos. Specifically, on the low-resolution Moving MNIST dataset, DFDNet achieves a 19% improvement on MSE over the previous state-of-the-art model. On the high-resolution SJTU4K dataset, it outperforms the previous state-of-the-art model by 10% on the LPIPS metric under similar inference time.

Cite

Text

Gan et al. "DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I3.32314

Markdown

[Gan et al. "DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/gan2025aaai-dfdnet/) doi:10.1609/AAAI.V39I3.32314

BibTeX

@inproceedings{gan2025aaai-dfdnet,
  title     = {{DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction}},
  author    = {Gan, Lianqiang and Lai, Junyu and Ju, Jingze and Gao, Lianli and Bin, Yi},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {3059-3067},
  doi       = {10.1609/AAAI.V39I3.32314},
  url       = {https://mlanthology.org/aaai/2025/gan2025aaai-dfdnet/}
}