Slimmable Video Codec

Abstract

Neural video compression has emerged as a novel paradigm combining trainable multilayer neural net-works and machine learning, achieving competitive rate-distortion (RD) performances, but still remaining impractical due to heavy neural architectures, with large memory and computational demands. In addition, models are usually optimized for a single RD tradeoff. Recent slimmable image codecs can dynamically adjust their model capacity to gracefully reduce the memory and computation requirements, without harming RD performance. In this paper we propose a slimmable video codec (SlimVC), by integrating a slimmable temporal entropy model in a slimmable autoencoder. Despite a significantly more complex architecture, we show that slimming remains a powerful mechanism to control rate, memory footprint, computational cost and latency, all being important requirements for practical video compression.

Cite

Text

Liu et al. "Slimmable Video Codec." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00183

Markdown

[Liu et al. "Slimmable Video Codec." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/liu2022cvprw-slimmable/) doi:10.1109/CVPRW56347.2022.00183

BibTeX

@inproceedings{liu2022cvprw-slimmable,
  title     = {{Slimmable Video Codec}},
  author    = {Liu, Zhaocheng and Herranz, Luis and Yang, Fei and Zhang, Saiping and Wan, Shuai and Mrak, Marta and Blanch, Marc Górriz},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {1742-1746},
  doi       = {10.1109/CVPRW56347.2022.00183},
  url       = {https://mlanthology.org/cvprw/2022/liu2022cvprw-slimmable/}
}