GLIC: General Format Learned Image Compression
Abstract
Learned image lossy compression techniques have surpassed traditional methods in both subjective vision and quantitative evaluation. However, current models are only applicable to three-channel image formats, limiting their practical application due to the diversity and complexity of image formats. We propose a high-performance learned image compression model for general image formats. We first introduce a transfer method to unify any-channel image formats, enhancing the applicability of neural networks. This method's effectiveness is demonstrated through image information entropy and image homomorphism theory. Then, we introduce an adaptive attention residual block into the entropy model to give it better generalization ability. Meanwhile, we propose an evenly grouped cross-channel context module for progressive preview image decoding. Experimental results demonstrate that our method achieves state-of-the-art (SOTA) in the field of learned image compression in terms of PSNR and MS-SSIM. This work extends the applicability of learned image compression techniques to more practical production environments.
Cite
Text
Zhou and Kong. "GLIC: General Format Learned Image Compression." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33175Markdown
[Zhou and Kong. "GLIC: General Format Learned Image Compression." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhou2025aaai-glic/) doi:10.1609/AAAI.V39I10.33175BibTeX
@inproceedings{zhou2025aaai-glic,
title = {{GLIC: General Format Learned Image Compression}},
author = {Zhou, Mingsheng and Kong, Mingming},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {10815-10824},
doi = {10.1609/AAAI.V39I10.33175},
url = {https://mlanthology.org/aaai/2025/zhou2025aaai-glic/}
}