P-Frame Coding Proposal by NCTU: Parametric Video Prediction Through Backprop-Based Motion Estimation
Abstract
This paper presents a parametric video prediction scheme with backprop-based motion estimation, in response to the CLIC challenge on P-frame compression. Recognizing that most learning-based video codecs rely on optical flow-based temporal prediction and suffer from having to signal a large amount of motion information, we propose to perform parametric overlapped block motion compensation on a sparse motion field. In forming this sparse motion field, we conduct the steepest descent algorithm on a loss function for identifying critical pixels, of which the motion vectors are communicated to the decoder. Moreover, we introduce a critical pixel dropout mechanism to strike a good balance between motion overhead and prediction quality. Compression results with HEVC-based residual coding on CLIC validation sequences show that our parametric video prediction achieves higher PSNR and MS-SSIM than optical flow-based warping. Moreover, our critical pixel dropout mechanism is found beneficial in terms of rate-distortion performance. Our scheme offers the potential for working with learned residual coding.
Cite
Text
Ho et al. "P-Frame Coding Proposal by NCTU: Parametric Video Prediction Through Backprop-Based Motion Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020. doi:10.1109/CVPRW50498.2020.00083Markdown
[Ho et al. "P-Frame Coding Proposal by NCTU: Parametric Video Prediction Through Backprop-Based Motion Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020.](https://mlanthology.org/cvprw/2020/ho2020cvprw-pframe/) doi:10.1109/CVPRW50498.2020.00083BibTeX
@inproceedings{ho2020cvprw-pframe,
title = {{P-Frame Coding Proposal by NCTU: Parametric Video Prediction Through Backprop-Based Motion Estimation}},
author = {Ho, Yung-Han and Chan, Chih-Chun and Alexandre, David and Peng, Wen-Hsiao and Chang, Chih-Peng},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2020},
pages = {598-601},
doi = {10.1109/CVPRW50498.2020.00083},
url = {https://mlanthology.org/cvprw/2020/ho2020cvprw-pframe/}
}