Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution

Abstract

Video super-resolution (VSR) aiming to reconstruct a high-resolution (HR) video from its low-resolution (LR) counterpart has made tremendous progress in recent years. However, it remains challenging to deploy existing VSR methods to real-world data with complex degradations. On the one hand, there are few well-aligned real-world VSR datasets, especially with large super-resolution scale factors, which limits the development of real-world VSR tasks. On the other hand, alignment algorithms in existing VSR methods perform poorly for real-world videos, leading to unsatisfactory results. As an attempt to address the aforementioned issues, we build a real-world ×4 VSR dataset, namely MVSR4×, where low- and high-resolution videos are captured with different focal length lenses of a smartphone, respectively. Moreover, we propose an effective alignment method for real-world VSR, namely EAVSR. EAVSR takes the proposed multi-layer adaptive spatial transform network (MultiAdaSTN) to refine the offsets provided by the pre-trained optical flow estimation network. Experimental results on RealVSR and MVSR4× datasets show the effectiveness and practicality of our method, and we achieve state-of-the-art performance in real-world VSR task. The dataset and code will be available at https://github.com/HITRainer/EAVSR.

Cite

Text

Wang et al. "Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00124

Markdown

[Wang et al. "Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/wang2023cvprw-benchmark/) doi:10.1109/CVPRW59228.2023.00124

BibTeX

@inproceedings{wang2023cvprw-benchmark,
  title     = {{Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution}},
  author    = {Wang, Ruohao and Liu, Xiaohui and Zhang, Zhilu and Wu, Xiaohe and Feng, Chun-Mei and Zhang, Lei and Zuo, Wangmeng},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {1168-1177},
  doi       = {10.1109/CVPRW59228.2023.00124},
  url       = {https://mlanthology.org/cvprw/2023/wang2023cvprw-benchmark/}
}