Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information

Abstract

Although learned approaches to video compression have been proposed with promising results, hand-engineered video codecs are still unbeaten. On the other hand, learned image compression has already surpassed traditional image codecs. In this paper, we propose a learned video compression system that mimics part of the pipeline of traditional codecs while leveraging learned image compression. It comprises two main modules: a learned intra-frame compression module, and a learned inter-frame compression module that is conditioned on intra-coded frames. These modules use separate learned probability models for entropy coding. The intra-frame codec uses a variant of nonlocal attention layers. Regarding the inter-frame codec, we propose an implicit motion information mechanism, and an enhancement of the inter-frame predictions by leveraging the high quality information of intra-coded frames. On the learned probability model side, we propose to use the reference frames as additional conditioning information. We used this system as our submitted entry for the 2021 Challenge on Learned Image Compression (CLIC). In our experiments, we show the effectiveness of our system and its components via a set of ablation studies.

Cite

Text

Zou et al. "Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021. doi:10.1109/CVPRW53098.2021.00208

Markdown

[Zou et al. "Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021.](https://mlanthology.org/cvprw/2021/zou2021cvprw-learned/) doi:10.1109/CVPRW53098.2021.00208

BibTeX

@inproceedings{zou2021cvprw-learned,
  title     = {{Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information}},
  author    = {Zou, Nannan and Zhang, Honglei and Cricri, Francesco and Tavakoli, Hamed R. and Lainema, Jani and Aksu, Emre and Hannuksela, Miska M. and Rahtu, Esa},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2021},
  pages     = {1870-1874},
  doi       = {10.1109/CVPRW53098.2021.00208},
  url       = {https://mlanthology.org/cvprw/2021/zou2021cvprw-learned/}
}