Real-Time 6k Image Rescaling with Rate-Distortion Optimization
Abstract
The task of image rescaling aims at embedding an high-resolution (HR) image into a low-resolution (LR) one that can contain embedded information for HR image reconstruction. Existing image rescaling methods do not optimize the LR image file size and recent flow-based rescaling methods are not real-time yet for HR image reconstruction (e.g., 6K). To address these two challenges, we propose a novel framework (HyperThumbnail) for real-time 6K rate-distortion-aware image rescaling. Our HyperThumbnail first embeds an HR image into a JPEG LR image (thumbnail) by an encoder with our proposed learnable JPEG quantization module, which optimizes the file size of the embedding LR JPEG image. Then, an efficient decoder reconstructs a high-fidelity HR (6K) image from the LR one in real time. Extensive experiments demonstrate that our framework outperforms previous image rescaling baselines in both rate-distortion performance and is much faster than prior work in HR image reconstruction speed.
Cite
Text
Qi et al. "Real-Time 6k Image Rescaling with Rate-Distortion Optimization." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01354Markdown
[Qi et al. "Real-Time 6k Image Rescaling with Rate-Distortion Optimization." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/qi2023cvpr-realtime/) doi:10.1109/CVPR52729.2023.01354BibTeX
@inproceedings{qi2023cvpr-realtime,
title = {{Real-Time 6k Image Rescaling with Rate-Distortion Optimization}},
author = {Qi, Chenyang and Yang, Xin and Cheng, Ka Leong and Chen, Ying-Cong and Chen, Qifeng},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {14092-14101},
doi = {10.1109/CVPR52729.2023.01354},
url = {https://mlanthology.org/cvpr/2023/qi2023cvpr-realtime/}
}