Real-Time 3D-Aware Portrait Editing from a Single Image

Abstract

This work presents , a practical method that can efficiently edit a face image following given prompts, like reference images or text descriptions, in a 3D-aware manner. To this end, a lightweight module is distilled from a 3D portrait generator and a text-to-image model, which provide prior knowledge of face geometry and superior editing capability, respectively. Such a design brings two compelling advantages over existing approaches. First, our method achieves real-time editing with a feedforward network (i.e., ∼0.04s per image), over 100× faster than the second competitor. Second, thanks to the powerful priors, our module could focus on the learning of editing-related variations, such that it manages to handle various types of editing simultaneously in the training phase and further supports fast adaptation to user-specified customized types of editing during inference (e.g., with ∼5min fine-tuning per style). Project page can be found here.

Cite

Text

Bai et al. "Real-Time 3D-Aware Portrait Editing from a Single Image." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72983-6_20

Markdown

[Bai et al. "Real-Time 3D-Aware Portrait Editing from a Single Image." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/bai2024eccv-realtime/) doi:10.1007/978-3-031-72983-6_20

BibTeX

@inproceedings{bai2024eccv-realtime,
  title     = {{Real-Time 3D-Aware Portrait Editing from a Single Image}},
  author    = {Bai, Qingyan and Shi, Zifan and Xu, Yinghao and Ouyang, Hao and Wang, Qiuyu and Yang, Ceyuan and Wang, Xuan and Wetzstein, Gordon and Shen, Yujun and Chen, Qifeng},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72983-6_20},
  url       = {https://mlanthology.org/eccv/2024/bai2024eccv-realtime/}
}