Slimmable Video Codec
Abstract
Neural video compression has emerged as a novel paradigm combining trainable multilayer neural net-works and machine learning, achieving competitive rate-distortion (RD) performances, but still remaining impractical due to heavy neural architectures, with large memory and computational demands. In addition, models are usually optimized for a single RD tradeoff. Recent slimmable image codecs can dynamically adjust their model capacity to gracefully reduce the memory and computation requirements, without harming RD performance. In this paper we propose a slimmable video codec (SlimVC), by integrating a slimmable temporal entropy model in a slimmable autoencoder. Despite a significantly more complex architecture, we show that slimming remains a powerful mechanism to control rate, memory footprint, computational cost and latency, all being important requirements for practical video compression.
Cite
Text
Liu et al. "Slimmable Video Codec." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00183Markdown
[Liu et al. "Slimmable Video Codec." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/liu2022cvprw-slimmable/) doi:10.1109/CVPRW56347.2022.00183BibTeX
@inproceedings{liu2022cvprw-slimmable,
title = {{Slimmable Video Codec}},
author = {Liu, Zhaocheng and Herranz, Luis and Yang, Fei and Zhang, Saiping and Wan, Shuai and Mrak, Marta and Blanch, Marc Górriz},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2022},
pages = {1742-1746},
doi = {10.1109/CVPRW56347.2022.00183},
url = {https://mlanthology.org/cvprw/2022/liu2022cvprw-slimmable/}
}