Idempotence and Perceptual Image Compression
Abstract
Idempotence is the stability of image codec to re-compression. At the first glance, it is unrelated to perceptual image compression. However, we find that theoretically: 1) Conditional generative model-based perceptual codec satisfies idempotence; 2) Unconditional generative model with idempotence constraint is equivalent to conditional generative codec. Based on this newfound equivalence, we propose a new paradigm of perceptual image codec by inverting unconditional generative model with idempotence constraints. Our codec is theoretically equivalent to conditional generative codec, and it does not require training new models. Instead, it only requires a pre-trained mean-square-error codec and unconditional generative model. Empirically, we show that our proposed approach outperforms state-of-the-art methods such as HiFiC and ILLM, in terms of Fréchet Inception Distance (FID). The source code is provided in https://github.com/tongdaxu/Idempotence-and-Perceptual-Image-Compression.
Cite
Text
Xu et al. "Idempotence and Perceptual Image Compression." International Conference on Learning Representations, 2024.Markdown
[Xu et al. "Idempotence and Perceptual Image Compression." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/xu2024iclr-idempotence/)BibTeX
@inproceedings{xu2024iclr-idempotence,
title = {{Idempotence and Perceptual Image Compression}},
author = {Xu, Tongda and Zhu, Ziran and He, Dailan and Li, Yanghao and Guo, Lina and Wang, Yuanyuan and Wang, Zhe and Qin, Hongwei and Wang, Yan and Liu, Jingjing and Zhang, Ya-Qin},
booktitle = {International Conference on Learning Representations},
year = {2024},
url = {https://mlanthology.org/iclr/2024/xu2024iclr-idempotence/}
}