Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography

Abstract

Metalenses offer significant potential for ultra-compact computational imaging but face challenges from complex optical degradation and computational restoration difficulties. Existing methods typically rely on precise optical calibration or massive paired datasets, which are non-trivial for real-world imaging systems. Furthermore, lack of control over the inference process often results in undesirable hallucinated artifacts. We introduce Degradation-Modeled Multipath Diffusion for tunable metalens photography, leveraging powerful natural image priors from pretrained models instead of large datasets. Our framework uses positive, neutral, and negative-prompt paths to balance high-frequency detail generation, structural fidelity, and suppression of metalens-specific degradation, alongside pseudo data augmentation. A tunable decoder enables controlled trade-offs between fidelity and perceptual quality. Additionally, a spatially varying degradation-aware attention (SVDA) module adaptively models complex optical and sensor-induced degradation. Finally, we design and build a millimeter-scale MetaCamera for real-world validation. Extensive results show that our approach outperforms state-of-the-art methods, achieving high-fidelity and sharp image reconstruction. More materials: https://dmdiff.github.io/.

Cite

Text

Zhang et al. "Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography." International Conference on Computer Vision, 2025.

Markdown

[Zhang et al. "Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/zhang2025iccv-degradationmodeled/)

BibTeX

@inproceedings{zhang2025iccv-degradationmodeled,
  title     = {{Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography}},
  author    = {Zhang, Jianing and Zhu, Jiayi and Ji, Feiyu and Yang, Xiaokang and Yuan, Xiaoyun},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {25914-25924},
  url       = {https://mlanthology.org/iccv/2025/zhang2025iccv-degradationmodeled/}
}