PILC: Practical Image Lossless Compression with an End-to-End GPU Oriented Neural Framework

Abstract

Generative model based image lossless compression algorithms have seen a great success in improving compression ratio. However, the throughput for most of them is less than 1 MB/s even with the most advanced AI accelerated chips, preventing them from most real-world applications, which often require 100 MB/s. In this paper, we propose PILC, an end-to-end image lossless compression framework that achieves 200 MB/s for both compression and decompression with a single NVIDIA Tesla V100 GPU, 10x faster than the most efficient one before. To obtain this result, we first develop an AI codec that combines auto-regressive model and VQ-VAE which performs well in lightweight setting, then we design a low complexity entropy coder that works well with our codec. Experiments show that our framework compresses better than PNG by a margin of 30% in multiple datasets. We believe this is an important step to bring AI compression forward to commercial use.

Cite

Text

Kang et al. "PILC: Practical Image Lossless Compression with an End-to-End GPU Oriented Neural Framework." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00372

Markdown

[Kang et al. "PILC: Practical Image Lossless Compression with an End-to-End GPU Oriented Neural Framework." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/kang2022cvpr-pilc/) doi:10.1109/CVPR52688.2022.00372

BibTeX

@inproceedings{kang2022cvpr-pilc,
  title     = {{PILC: Practical Image Lossless Compression with an End-to-End GPU Oriented Neural Framework}},
  author    = {Kang, Ning and Qiu, Shanzhao and Zhang, Shifeng and Li, Zhenguo and Xia, Shu-Tao},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {3739-3748},
  doi       = {10.1109/CVPR52688.2022.00372},
  url       = {https://mlanthology.org/cvpr/2022/kang2022cvpr-pilc/}
}