GlassesGAN: Eyewear Personalization Using Synthetic Appearance Discovery and Targeted Subspace Modeling
Abstract
We present GlassesGAN, a novel image editing framework for custom design of glasses, that sets a new standard in terms of output-image quality, edit realism, and continuous multi-style edit capability. To facilitate the editing process with GlassesGAN, we propose a Targeted Subspace Modelling (TSM) procedure that, based on a novel mechanism for (synthetic) appearance discovery in the latent space of a pre-trained GAN generator, constructs an eyeglasses-specific (latent) subspace that the editing framework can utilize. Additionally, we also introduce an appearance-constrained subspace initialization (SI) technique that centers the latent representation of the given input image in the well-defined part of the constructed subspace to improve the reliability of the learned edits. We test GlassesGAN on two (diverse) high-resolution datasets (CelebA-HQ and SiblingsDB-HQf) and compare it to three state-of-the-art baselines, i.e., InterfaceGAN, GANSpace, and MaskGAN. The reported results show that GlassesGAN convincingly outperforms all competing techniques, while offering functionality (e.g., fine-grained multi-style editing) not available with any of the competitors. The source code for GlassesGAN is made publicly available.
Cite
Text
Plesh et al. "GlassesGAN: Eyewear Personalization Using Synthetic Appearance Discovery and Targeted Subspace Modeling." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01616Markdown
[Plesh et al. "GlassesGAN: Eyewear Personalization Using Synthetic Appearance Discovery and Targeted Subspace Modeling." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/plesh2023cvpr-glassesgan/) doi:10.1109/CVPR52729.2023.01616BibTeX
@inproceedings{plesh2023cvpr-glassesgan,
title = {{GlassesGAN: Eyewear Personalization Using Synthetic Appearance Discovery and Targeted Subspace Modeling}},
author = {Plesh, Richard and Peer, Peter and Struc, Vitomir},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {16847-16857},
doi = {10.1109/CVPR52729.2023.01616},
url = {https://mlanthology.org/cvpr/2023/plesh2023cvpr-glassesgan/}
}