CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models

Abstract

Compositional zero-shot learning (CZSL) is to recognize unseen attribute-object compositions by learning from seen compositions. The distribution shift between unseen compositions and seen compositions poses challenges to CZSL models, especially when test images are mixed with both seen and unseen compositions. The challenge will be addressed more easily if a model can distinguish unseen/seen compositions and treat them with specific recognition strategies. However, identifying images with unseen compositions is non-trivial, considering that unseen compositions are absent in training and usually contain only subtle differences from seen compositions. In this paper, we propose a novel compositional zero-shot learning method called COMO, which composes outliers in training for distinguishing seen and unseen compositions and further applying specific strategies for them. Specifically, we compose attribute-object representations for unseen compositions based on primitive representations of training images as outliers to enable the model to identify unseen compositions in inference. At test time, the method distinguishes images containing seen/unseen compositions and uses different weights for composition classification and primitive classification to recognize seen/unseen compositions. Experimental results on three datasets show the effectiveness of our method in both the closed-world setting and the open-world setting.

Cite

Text

Suin and Chellappa. "CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/143

Markdown

[Suin and Chellappa. "CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/suin2024ijcai-clr/) doi:10.24963/ijcai.2024/143

BibTeX

@inproceedings{suin2024ijcai-clr,
  title     = {{CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models}},
  author    = {Suin, Maitreya and Chellappa, Rama},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {1290-1298},
  doi       = {10.24963/ijcai.2024/143},
  url       = {https://mlanthology.org/ijcai/2024/suin2024ijcai-clr/}
}