Pixel-Level and Semantic-Level Adjustable Super-Resolution: A Dual-LoRA Approach

Sun, Lingchen; Wu, Rongyuan; Ma, Zhiyuan; Liu, Shuaizheng; Yi, Qiaosi; Zhang, Lei

doi:10.1109/CVPR52734.2025.00223

Pixel-Level and Semantic-Level Adjustable Super-Resolution: A Dual-LoRA Approach

Lingchen Sun, Rongyuan Wu, Zhiyuan Ma, Shuaizheng Liu, Qiaosi Yi, Lei Zhang

CVPR 2025 pp. 2333-2343

doi:10.1109/CVPR52734.2025.00223 /cvpr/2025/sun2025cvpr-pixellevel/

Abstract

Diffusion prior-based methods have shown impressive results in real-world image super-resolution (SR). However, most existing methods entangle pixel-level and semantic-level SR objectives in the training process, struggling to balance pixel-wise fidelity and perceptual quality. Meanwhile, users have varying preferences on SR results, thus it is demanded to develop an adjustable SR model that can be tailored to different fidelity-perception preferences during inference without re-training. We present Pixel-level and Semantic-level Adjustable SR (PiSA-SR), which learns two LoRA modules upon the pre-trained stable-diffusion (SD) model to achieve improved and adjustable SR results. We first formulate the SD-based SR problem as learning the residual between the low-quality input and the high-quality output, then show that the learning objective can be decoupled into two distinct LoRA weight spaces: one is characterized by the l2-loss for pixel-level regression, and another is characterized by the LPIPS and classifier score distillation losses to extract semantic information from pre-trained classification and SD models. In its default setting, PiSA-SR can be performed in a single diffusion step, achieving leading real-world SR results in both quality and efficiency. By introducing two adjustable guidance scales on the two LoRA modules to control the strengths of pixel-wise fidelity and semantic-level details during inference, PiSA-SR can offer flexible SR results according to user preference without re-training. The source code of our method can be found at https://github.com/csslc/PiSA-SR.

PDF CVPR Semantic Scholar

Cite

Text

Sun et al. "Pixel-Level and Semantic-Level Adjustable Super-Resolution: A Dual-LoRA Approach." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.00223

Markdown

[Sun et al. "Pixel-Level and Semantic-Level Adjustable Super-Resolution: A Dual-LoRA Approach." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/sun2025cvpr-pixellevel/) doi:10.1109/CVPR52734.2025.00223

BibTeX

@inproceedings{sun2025cvpr-pixellevel,
  title     = {{Pixel-Level and Semantic-Level Adjustable Super-Resolution: A Dual-LoRA Approach}},
  author    = {Sun, Lingchen and Wu, Rongyuan and Ma, Zhiyuan and Liu, Shuaizheng and Yi, Qiaosi and Zhang, Lei},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {2333-2343},
  doi       = {10.1109/CVPR52734.2025.00223},
  url       = {https://mlanthology.org/cvpr/2025/sun2025cvpr-pixellevel/}
}