Tri^2-Plane: Thinking Head Avatar via Feature Pyramid

Song, Luchuan; Liu, Pinxin; Chen, Lele; Yin, Guojun; Xu, Chenliang

doi:10.1007/978-3-031-72920-1_1

Tri^2-Plane: Thinking Head Avatar via Feature Pyramid

Luchuan Song, Pinxin Liu, Lele Chen, Guojun Yin, Chenliang Xu

ECCV 2024

doi:10.1007/978-3-031-72920-1_1 /eccv/2024/song2024eccv-tri/

Abstract

Recent years have witnessed considerable achievements in facial avatar reconstruction with neural volume rendering. Despite notable advancements, the reconstruction of complex and dynamic head movements from monocular videos still suffers from capturing and restoring fine-grained details. In this work, we propose a novel approach, named Tri2 -plane, for monocular photo-realistic volumetric head avatar reconstructions. Distinct from the existing works that rely on a single tri-plane deformation field for dynamic facial modeling, the proposed Tri2 -plane leverages the principle of feature pyramids and three top-to-down lateral connections tri-planes for details improvement. It samples and renders facial details at multiple scales, transitioning from the entire face to specific local regions and then to even more refined sub-regions. Moreover, we incorporate a camera-based geometry-aware sliding window method as an augmentation in training, which improves the robustness beyond the canonical space, with a particular improvement in cross-identity generation capabilities. Experimental outcomes indicate that the Tri2 -plane not only surpasses existing methodologies but also achieves superior performance across quantitative and qualitative assessments. The project website is: https://songluchuan.github.io/Tri2Plane.github. io/.

PDF ECCV Semantic Scholar

Cite

Text

Song et al. "Tri^2-Plane: Thinking Head Avatar via Feature Pyramid." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72920-1_1

Markdown

[Song et al. "Tri^2-Plane: Thinking Head Avatar via Feature Pyramid." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/song2024eccv-tri/) doi:10.1007/978-3-031-72920-1_1

BibTeX

@inproceedings{song2024eccv-tri,
  title     = {{Tri^2-Plane: Thinking Head Avatar via Feature Pyramid}},
  author    = {Song, Luchuan and Liu, Pinxin and Chen, Lele and Yin, Guojun and Xu, Chenliang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72920-1_1},
  url       = {https://mlanthology.org/eccv/2024/song2024eccv-tri/}
}