Recurrent Back-Projection Network for Video Super-Resolution
Abstract
We proposed a novel architecture for the problem of video super-resolution. We integrate spatial and temporal contexts from continuous video frames using a recurrent encoder-decoder module, that fuses multi-frame information with the more traditional, single frame super-resolution path for the target frame. In contrast to most prior work where frames are pooled together by stacking or warping, our model, the Recurrent Back-Projection Network (RBPN) treats each context frame as a separate source of information. These sources are combined in an iterative refinement framework inspired by the idea of back-projection in multiple-image super-resolution. This is aided by explicitly representing estimated inter-frame motion with respect to the target, rather than explicitly aligning frames. We propose a new video super-resolution benchmark, allowing evaluation at a larger scale and considering videos in different motion regimes. Experimental results demonstrate that our RBPN is superior to existing methods on several datasets.
Cite
Text
Haris et al. "Recurrent Back-Projection Network for Video Super-Resolution." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00402Markdown
[Haris et al. "Recurrent Back-Projection Network for Video Super-Resolution." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/haris2019cvpr-recurrent/) doi:10.1109/CVPR.2019.00402BibTeX
@inproceedings{haris2019cvpr-recurrent,
title = {{Recurrent Back-Projection Network for Video Super-Resolution}},
author = {Haris, Muhammad and Shakhnarovich, Gregory and Ukita, Norimichi},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00402},
url = {https://mlanthology.org/cvpr/2019/haris2019cvpr-recurrent/}
}