DAD-3DHeads: A Large-Scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image

Abstract

We present DAD-3DHeads, a dense and diverse large-scale dataset, and a robust model for 3D Dense Head Alignment in-the-wild. It contains annotations of over 3.5K landmarks that accurately represent 3D head shape compared to the ground-truth scans. The data-driven model, DAD-3DNet, trained on our dataset, learns shape, expression, and pose parameters, and performs 3D reconstruction of a FLAME mesh. The model also incorporates a landmark prediction branch to take advantage of rich supervision and co-training of multiple related tasks. Experimentally, DAD- 3DNet outperforms or is comparable to the state-of-the-art models in (i) 3D Head Pose Estimation on AFLW2000-3D and BIWI, (ii) 3D Face Shape Reconstruction on NoW and Feng, and (iii) 3D Dense Head Alignment and 3D Landmarks Estimation on DAD-3DHeads dataset. Finally, diversity of DAD-3DHeads in camera angles, facial expressions, and occlusions enables a benchmark to study in-the-wild generalization and robustness to distribution shifts. The dataset webpage is https://p.farm/research/dad-3dheads.

Cite

Text

Martyniuk et al. "DAD-3DHeads: A Large-Scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.02027

Markdown

[Martyniuk et al. "DAD-3DHeads: A Large-Scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/martyniuk2022cvpr-dad3dheads/) doi:10.1109/CVPR52688.2022.02027

BibTeX

@inproceedings{martyniuk2022cvpr-dad3dheads,
  title     = {{DAD-3DHeads: A Large-Scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image}},
  author    = {Martyniuk, Tetiana and Kupyn, Orest and Kurlyak, Yana and Krashenyi, Igor and Matas, Jiří and Sharmanska, Viktoriia},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {20942-20952},
  doi       = {10.1109/CVPR52688.2022.02027},
  url       = {https://mlanthology.org/cvpr/2022/martyniuk2022cvpr-dad3dheads/}
}