Implicit Neural Video Compression

Abstract

We propose a method to compress full-resolution video sequences with implicit neural representations. Each frame is represented as a neural network that maps coordinate positions to pixel values. We use a separate implicit network to modulate the coordinate inputs, which enables efficient motion compensation between frames. Together with a small residual network, this allows us to efficiently compress P-frames relative to the previous frame. We further lower the bitrate by storing the network weights with learned integer quantization. Our method offers several simplifications over established neural video codecs: it does not require the receiver to have access to a pretrained neural network, does not use expensive interpolation-based warping operations, and does not require a separate training dataset.

Cite

Text

Zhang et al. "Implicit Neural Video Compression." ICLR 2022 Workshops: DGM4HSD, 2022.

Markdown

[Zhang et al. "Implicit Neural Video Compression." ICLR 2022 Workshops: DGM4HSD, 2022.](https://mlanthology.org/iclrw/2022/zhang2022iclrw-implicit/)

BibTeX

@inproceedings{zhang2022iclrw-implicit,
  title     = {{Implicit Neural Video Compression}},
  author    = {Zhang, Yunfan and van Rozendaal, Ties and Brehmer, Johann and Nagel, Markus and Cohen, Taco},
  booktitle = {ICLR 2022 Workshops: DGM4HSD},
  year      = {2022},
  url       = {https://mlanthology.org/iclrw/2022/zhang2022iclrw-implicit/}
}