The Devil Is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Abstract

The task of manipulating real image attributes through StyleGAN inversion has been extensively researched. This process involves searching latent variables from a well-trained StyleGAN generator that can synthesize a real image modifying these latent variables and then synthesizing an image with the desired edits. A balance must be struck between the quality of the reconstruction and the ability to edit. Earlier studies utilized the low-dimensional W-space for latent search which facilitated effective editing but struggled with reconstructing intricate details. More recent research has turned to the high-dimensional feature space F which successfully inverses the input image but loses much of the detail during editing. In this paper we introduce StyleFeatureEditor -- a novel method that enables editing in both w-latents and F-latents. This technique not only allows for the reconstruction of finer image details but also ensures their preservation during editing. We also present a new training pipeline specifically designed to train our model to accurately edit F-latents. Our method is compared with state-of-the-art encoding approaches demonstrating that our model excels in terms of reconstruction quality and is capable of editing even challenging out-of-domain examples.

Cite

Text

Bobkov et al. "The Devil Is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00892

Markdown

[Bobkov et al. "The Devil Is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/bobkov2024cvpr-devil/) doi:10.1109/CVPR52733.2024.00892

BibTeX

@inproceedings{bobkov2024cvpr-devil,
  title     = {{The Devil Is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing}},
  author    = {Bobkov, Denis and Titov, Vadim and Alanov, Aibek and Vetrov, Dmitry},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {9337-9346},
  doi       = {10.1109/CVPR52733.2024.00892},
  url       = {https://mlanthology.org/cvpr/2024/bobkov2024cvpr-devil/}
}