Enhancing Video Super-Resolution via Implicit Resampling-Based Alignment

Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao

CVPR 2024 pp. 2546-2555

doi:10.1109/CVPR52733.2024.00246 /cvpr/2024/xu2024cvpr-enhancing/

Abstract

In video super-resolution it is common to use a frame-wise alignment to support the propagation of information over time. The role of alignment is well-studied for low-level enhancement in video but existing works overlook a critical step -- resampling. We show through extensive experiments that for alignment to be effective the resampling should preserve the reference frequency spectrum while minimizing spatial distortions. However most existing works simply use a default choice of bilinear interpolation for resampling even though bilinear interpolation has a smoothing effect and hinders super-resolution. From these observations we propose an implicit resampling-based alignment. The sampling positions are encoded by a sinusoidal positional encoding while the value is estimated with a coordinate network and a window-based cross-attention. We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies. Experiments on synthetic and real-world datasets show that alignment with our proposed implicit resampling enhances the performance of state-of-the-art frameworks with minimal impact on both compute and parameters.

PDF CVPR Semantic Scholar

Cite

Text

Xu et al. "Enhancing Video Super-Resolution via Implicit Resampling-Based Alignment." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00246

Markdown

[Xu et al. "Enhancing Video Super-Resolution via Implicit Resampling-Based Alignment." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/xu2024cvpr-enhancing/) doi:10.1109/CVPR52733.2024.00246

BibTeX

@inproceedings{xu2024cvpr-enhancing,
  title     = {{Enhancing Video Super-Resolution via Implicit Resampling-Based Alignment}},
  author    = {Xu, Kai and Yu, Ziwei and Wang, Xin and Mi, Michael Bi and Yao, Angela},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {2546-2555},
  doi       = {10.1109/CVPR52733.2024.00246},
  url       = {https://mlanthology.org/cvpr/2024/xu2024cvpr-enhancing/}
}