UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation

Li, Hong; Feng, Yutang; Xue, Song; Liu, Xuhui; Zeng, Bohan; Li, Shanglin; Liu, Boyu; Liu, Jianzhuang; Han, Shumin; Zhang, Baochang

doi:10.1109/CVPR52733.2024.01007

UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation

Hong Li, Yutang Feng, Song Xue, Xuhui Liu, Bohan Zeng, Shanglin Li, Boyu Liu, Jianzhuang Liu, Shumin Han, Baochang Zhang

CVPR 2024 pp. 10585-10595

doi:10.1109/CVPR52733.2024.01007 /cvpr/2024/li2024cvpr-uvidm/

Abstract

3D face reconstruction aims at generating high-fidelity 3D face shapes and textures from single-view or multi-view images. However current prevailing facial texture generation methods generally suffer from low-quality texture identity information loss and inadequate handling of occlusions. To solve these problems we introduce an Identity-Conditioned Latent Diffusion Model for face UV-texture generation (UV-IDM) to generate photo-realistic textures based on the Basel Face Model (BFM). UV-IDM leverages the powerful texture generation capacity of a latent diffusion model (LDM) to obtain detailed facial textures. To preserve the identity during the reconstruction procedure we design an identity-conditioned module that can utilize any in-the-wild image as a robust condition for the LDM to guide texture generation. UV-IDM can be easily adapted to different BFM-based methods as a high-fidelity texture generator. Furthermore in light of the limited accessibility of most existing UV-texture datasets we build a large-scale and publicly available UV-texture dataset based on BFM termed BFM-UV. Extensive experiments show that our UV-IDM can generate high-fidelity textures in 3D face reconstruction within seconds while maintaining image consistency bringing new state-of-the-art performance in facial texture generation.

PDF CVPR Semantic Scholar

Cite

Text

Li et al. "UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01007

Markdown

[Li et al. "UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/li2024cvpr-uvidm/) doi:10.1109/CVPR52733.2024.01007

BibTeX

@inproceedings{li2024cvpr-uvidm,
  title     = {{UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation}},
  author    = {Li, Hong and Feng, Yutang and Xue, Song and Liu, Xuhui and Zeng, Bohan and Li, Shanglin and Liu, Boyu and Liu, Jianzhuang and Han, Shumin and Zhang, Baochang},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {10585-10595},
  doi       = {10.1109/CVPR52733.2024.01007},
  url       = {https://mlanthology.org/cvpr/2024/li2024cvpr-uvidm/}
}