Real-Time 6k Image Rescaling with Rate-Distortion Optimization

Abstract

The task of image rescaling aims at embedding an high-resolution (HR) image into a low-resolution (LR) one that can contain embedded information for HR image reconstruction. Existing image rescaling methods do not optimize the LR image file size and recent flow-based rescaling methods are not real-time yet for HR image reconstruction (e.g., 6K). To address these two challenges, we propose a novel framework (HyperThumbnail) for real-time 6K rate-distortion-aware image rescaling. Our HyperThumbnail first embeds an HR image into a JPEG LR image (thumbnail) by an encoder with our proposed learnable JPEG quantization module, which optimizes the file size of the embedding LR JPEG image. Then, an efficient decoder reconstructs a high-fidelity HR (6K) image from the LR one in real time. Extensive experiments demonstrate that our framework outperforms previous image rescaling baselines in both rate-distortion performance and is much faster than prior work in HR image reconstruction speed.

Cite

Text

Qi et al. "Real-Time 6k Image Rescaling with Rate-Distortion Optimization." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01354

Markdown

[Qi et al. "Real-Time 6k Image Rescaling with Rate-Distortion Optimization." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/qi2023cvpr-realtime/) doi:10.1109/CVPR52729.2023.01354

BibTeX

@inproceedings{qi2023cvpr-realtime,
  title     = {{Real-Time 6k Image Rescaling with Rate-Distortion Optimization}},
  author    = {Qi, Chenyang and Yang, Xin and Cheng, Ka Leong and Chen, Ying-Cong and Chen, Qifeng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {14092-14101},
  doi       = {10.1109/CVPR52729.2023.01354},
  url       = {https://mlanthology.org/cvpr/2023/qi2023cvpr-realtime/}
}