A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos
Abstract
We present a new encoder architecture for GAN inversion. The task is to reconstruct a real image from the latent space of a pre-trained Generative Adversarial Network (GAN). Unlike previous encoder-based methods which predict only a latent code from a real image, the proposed encoder maps the given image to both a latent code and a feature tensor, simultaneously. The feature tensor is key for accurate inversion, which helps to obtain better perceptual quality and lower reconstruction error. We conduct extensive experiments for several style-based generators pre-trained on different data domains. Our method is the first feed-forward encoder to include the feature tensor in the inversion, outperforming the state-of-the-art encoder-based methods for GAN inversion. Additionally, experiments on video inversion show that our method yields a more accurate and stable inversion for videos. This offers the possibility to perform real-time editing in videos.
Cite
Text
Yao et al. "A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19784-0_34Markdown
[Yao et al. "A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/yao2022eccv-stylebased/) doi:10.1007/978-3-031-19784-0_34BibTeX
@inproceedings{yao2022eccv-stylebased,
title = {{A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos}},
author = {Yao, Xu and Newson, Alasdair and Gousseau, Yann and Hellier, Pierre},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19784-0_34},
url = {https://mlanthology.org/eccv/2022/yao2022eccv-stylebased/}
}