Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

Zhang, Guozhen; Zhu, Yuhan; Wang, Haonan; Chen, Youxin; Wu, Gangshan; Wang, Limin

doi:10.1109/CVPR52729.2023.00550

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, Limin Wang

CVPR 2023 pp. 5682-5692

doi:10.1109/CVPR52729.2023.00550 /cvpr/2023/zhang2023cvpr-extracting/

Abstract

Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or devise separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a new module to explicitly extract motion and appearance information via a unified operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.

PDF CVPR Semantic Scholar

Cite

Text

Zhang et al. "Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00550

Markdown

[Zhang et al. "Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/zhang2023cvpr-extracting/) doi:10.1109/CVPR52729.2023.00550

BibTeX

@inproceedings{zhang2023cvpr-extracting,
  title     = {{Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation}},
  author    = {Zhang, Guozhen and Zhu, Yuhan and Wang, Haonan and Chen, Youxin and Wu, Gangshan and Wang, Limin},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {5682-5692},
  doi       = {10.1109/CVPR52729.2023.00550},
  url       = {https://mlanthology.org/cvpr/2023/zhang2023cvpr-extracting/}
}