Implicit Neural Video Compression
Abstract
We propose a method to compress full-resolution video sequences with implicit neural representations. Each frame is represented as a neural network that maps coordinate positions to pixel values. We use a separate implicit network to modulate the coordinate inputs, which enables efficient motion compensation between frames. Together with a small residual network, this allows us to efficiently compress P-frames relative to the previous frame. We further lower the bitrate by storing the network weights with learned integer quantization. Our method offers several simplifications over established neural video codecs: it does not require the receiver to have access to a pretrained neural network, does not use expensive interpolation-based warping operations, and does not require a separate training dataset.
Cite
Text
Zhang et al. "Implicit Neural Video Compression." ICLR 2022 Workshops: DGM4HSD, 2022.Markdown
[Zhang et al. "Implicit Neural Video Compression." ICLR 2022 Workshops: DGM4HSD, 2022.](https://mlanthology.org/iclrw/2022/zhang2022iclrw-implicit/)BibTeX
@inproceedings{zhang2022iclrw-implicit,
title = {{Implicit Neural Video Compression}},
author = {Zhang, Yunfan and van Rozendaal, Ties and Brehmer, Johann and Nagel, Markus and Cohen, Taco},
booktitle = {ICLR 2022 Workshops: DGM4HSD},
year = {2022},
url = {https://mlanthology.org/iclrw/2022/zhang2022iclrw-implicit/}
}