BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

Abstract

A recurrent structure is a popular framework choice for the task of video super-resolution. The state-of-the-art method BasicVSR adopts bidirectional propagation with feature alignment to effectively exploit information from the entire input video. In this study, we redesign BasicVSR by proposing second-order grid propagation and flow-guided deformable alignment. We show that by empowering the recurrent framework with enhanced propagation and alignment, one can exploit spatiotemporal information across misaligned video frames more effectively. The new components lead to an improved performance under a similar computational constraint. In particular, our model BasicVSR++ surpasses BasicVSR by a significant 0.82 dB in PSNR with similar number of parameters. BasicVSR++ is generalizable to other video restoration tasks, and obtains three champions and one first runner-up in NTIRE 2021 video restoration challenge.

Cite

Text

Chan et al. "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00588

Markdown

[Chan et al. "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/chan2022cvpr-basicvsr/) doi:10.1109/CVPR52688.2022.00588

BibTeX

@inproceedings{chan2022cvpr-basicvsr,
  title     = {{BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment}},
  author    = {Chan, Kelvin C.K. and Zhou, Shangchen and Xu, Xiangyu and Loy, Chen Change},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {5972-5981},
  doi       = {10.1109/CVPR52688.2022.00588},
  url       = {https://mlanthology.org/cvpr/2022/chan2022cvpr-basicvsr/}
}