Text-Aware Image Restoration with Diffusion Models

Min, Jaewon; Kim, Jin Hyeon; Cho, Paul Hyunbin; Lee, Jaeeun; Park, Jihye; Kyu, Park Min; Kim, Sangpil; Park, Hyunhee; Kim, Seungryong

Text-Aware Image Restoration with Diffusion Models

Jaewon Min, Jin Hyeon Kim, Paul Hyunbin Cho, Jaeeun Lee, Jihye Park, Park Min Kyu, Sangpil Kim, Hyunhee Park, Seungryong Kim

ICLR 2026

/iclr/2026/min2026iclr-textaware/

Abstract

While diffusion models have achieved remarkable success in natural image restoration, they often fail to faithfully recover textual regions, frequently producing plausible yet incorrect text-like patterns, a phenomenon we term text-image hallucination. To address this limitation, we propose Text-Aware Image Restoration (TAIR), a task requiring simultaneous recovery of visual content and textual fidelity. For this purpose, we introduce SA-Text, a large-scale benchmark of 100K high-quality scene images with dense annotations of diverse and complex text instances. We further present a multi-task diffusion framework, TeReDiff, which leverages internal features of diffusion models to jointly train a text-spotting module with the restoration module. This design allows intermediate text predictions from the text-spotting module to condition the diffusion-based restoration process during denoising, thereby enhancing text recovery. Extensive experiments demonstrate that our approach faithfully restores textual regions, outperforms existing diffusion-based methods, and achieves new state-of-the-art results on TextZoom, an STISR benchmark considered a subtask of TAIR. The code, weights, and dataset will be publicly released.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Min et al. "Text-Aware Image Restoration with Diffusion Models." International Conference on Learning Representations, 2026.

Markdown

[Min et al. "Text-Aware Image Restoration with Diffusion Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/min2026iclr-textaware/)

BibTeX

@inproceedings{min2026iclr-textaware,
  title     = {{Text-Aware Image Restoration with Diffusion Models}},
  author    = {Min, Jaewon and Kim, Jin Hyeon and Cho, Paul Hyunbin and Lee, Jaeeun and Park, Jihye and Kyu, Park Min and Kim, Sangpil and Park, Hyunhee and Kim, Seungryong},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/min2026iclr-textaware/}
}