Arc2Face: A Foundation Model for ID-Consistent Human Faces
Abstract
This paper presents , an identity-conditioned face foundation model, which, given the ArcFace embedding of a person, can generate diverse photo-realistic images with an unparalleled degree of face similarity than existing models. Despite previous attempts to decode face recognition features into detailed images, we find that common high-resolution datasets (FFHQ) lack sufficient identities to reconstruct any subject. To that end, we meticulously upsample a significant portion of the WebFace42M database, the largest public dataset for face recognition (FR). builds upon a pretrained Stable Diffusion model, yet adapts it to the task of ID-to-face generation, conditioned solely on ID vectors. Deviating from recent works that combine ID with text embeddings for zero-shot personalization of text-to-image models, we emphasize on the compactness of FR features, which can fully capture the essence of the human face, as opposed to hand-crafted prompts. Crucially, text-augmented models struggle to decouple identity and text, usually necessitating some description of the given face to achieve satisfactory similarity. , however, only needs the discriminative features of ArcFace to guide the generation, offering a robust prior for a plethora of tasks where ID consistency is of paramount importance. As an example, we train a FR model on synthetic images from our model and achieve superior performance to existing synthetic datasets.
Cite
Text
Papantoniou et al. "Arc2Face: A Foundation Model for ID-Consistent Human Faces." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72913-3_14Markdown
[Papantoniou et al. "Arc2Face: A Foundation Model for ID-Consistent Human Faces." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/papantoniou2024eccv-arc2face/) doi:10.1007/978-3-031-72913-3_14BibTeX
@inproceedings{papantoniou2024eccv-arc2face,
title = {{Arc2Face: A Foundation Model for ID-Consistent Human Faces}},
author = {Papantoniou, Foivos Paraperas and Lattas, Alexandros and Moschoglou, Stylianos and Deng, Jiankang and Kainz, Bernhard and Zafeiriou, Stefanos},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72913-3_14},
url = {https://mlanthology.org/eccv/2024/papantoniou2024eccv-arc2face/}
}