FasterVD: On Acceleration of Video Diffusion Models

Yu, Pinrui; Luo, Dan; Rupprecht, Timothy; Lu, Lei; Kong, Zhenglun; Zhao, Pu; Li, Yanyu; Camps, Octavia I.; Lin, Xue; Wang, Yanzhi

doi:10.24963/ijcai.2024/1044

FasterVD: On Acceleration of Video Diffusion Models

Pinrui Yu, Dan Luo, Timothy Rupprecht, Lei Lu, Zhenglun Kong, Pu Zhao, Yanyu Li, Octavia I. Camps, Xue Lin, Yanzhi Wang

IJCAI 2024 pp. 8838-8842

doi:10.24963/ijcai.2024/1044 /ijcai/2024/yu2024ijcai-fastervd/

Abstract

The deep operator networks (DON), a class of neural operators that learn mappings between function spaces, have recently emerged as surrogate models for parametric partial differential equations (PDEs). However, their full potential for accurately approximating general black-box PDEs remains underexplored due to challenges in training stability and performance, primarily arising from difficulties in learning mappings between low-dimensional inputs and high-dimensional outputs. Furthermore, inadequate encoding of input functions and query positions limits the generalization ability of DONs. To address these challenges, we propose the Dynamical Coupled Operator (DCO), which incorporates temporal dynamics to learn coupled functions, reducing information loss and improving training robustness. Additionally, we introduce an adaptive spectral input function encoder based on empirical mode decomposition to enhance input function representation, as well as a hybrid location encoder to improve query location encoding. We provide theoretical guarantees on the universal expressiveness of DCO, ensuring its applicability to a wide range of PDE problems. Extensive experiments on real-world, high-dimensional PDE datasets demonstrate that DCO significantly outperforms DONs.

PDF IJCAI Semantic Scholar

Cite

Text

Yu et al. "FasterVD: On Acceleration of Video Diffusion Models." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/1044

Markdown

[Yu et al. "FasterVD: On Acceleration of Video Diffusion Models." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/yu2024ijcai-fastervd/) doi:10.24963/ijcai.2024/1044

BibTeX

@inproceedings{yu2024ijcai-fastervd,
  title     = {{FasterVD: On Acceleration of Video Diffusion Models}},
  author    = {Yu, Pinrui and Luo, Dan and Rupprecht, Timothy and Lu, Lei and Kong, Zhenglun and Zhao, Pu and Li, Yanyu and Camps, Octavia I. and Lin, Xue and Wang, Yanzhi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {8838-8842},
  doi       = {10.24963/ijcai.2024/1044},
  url       = {https://mlanthology.org/ijcai/2024/yu2024ijcai-fastervd/}
}