Auto-Encoded Supervision for Perceptual Image Super-Resolution

Abstract

This work tackles the fidelity objective in the perceptual super-resolution (SR) task. Specifically, we address the shortcomings of pixel-level \mathcal L _\text p loss (\mathcal L _\text pix ) in the GAN-based SR framework. Since \mathcal L _\text pix is known to have a trade-off relationship against perceptual quality, prior methods often multiply a small scale factor or utilize low-pass filters. However, this work shows that these circumventions fail to address the fundamental factor that induces blurring. Accordingly, we focus on two points: 1) precisely discriminating the subcomponent of \mathcal L _\text pix that contributes to blurring, and 2) guiding reconstruction only based on the factor that is free from this trade-off relationship. We show that this can be achieved in a surprisingly simple manner, with an Auto-Encoder (AE) pretrained using \mathcal L _\text pix . Based on this insight, we propose the Auto-Encoded Supervision for Optimal Penalization loss (\mathcal L _\text AESOP ), a novel loss function that measures distance in the AE space (the space after the decoder, not the bottleneck), rather than in the raw pixel space. By simply substituting \mathcal L _\text pix with \mathcal L _\text AESOP , we can provide effective reconstruction guidance without compromising perceptual quality. Designed for simplicity, our method enables easy integration into existing SR frameworks. Extensive experiments demonstrate the effectiveness of AESOP.

Cite

Text

Lee et al. "Auto-Encoded Supervision for Perceptual Image Super-Resolution." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01673

Markdown

[Lee et al. "Auto-Encoded Supervision for Perceptual Image Super-Resolution." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/lee2025cvpr-autoencoded/) doi:10.1109/CVPR52734.2025.01673

BibTeX

@inproceedings{lee2025cvpr-autoencoded,
  title     = {{Auto-Encoded Supervision for Perceptual Image Super-Resolution}},
  author    = {Lee, MinKyu and Hyun, Sangeek and Jun, Woojin and Heo, Jae-Pil},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {17958-17968},
  doi       = {10.1109/CVPR52734.2025.01673},
  url       = {https://mlanthology.org/cvpr/2025/lee2025cvpr-autoencoded/}
}