Style Transfer for 2D Talking Head Generation

Abstract

Audio-driven talking head animation is a challenging research topic with many real-world applications. Recent works have focused on creating photo-realistic 2D animation, while learning different talking or singing styles remains an open problem. In this paper, we present a new method to generate talking head animation with learnable style references. Given a set of style reference frames, our framework can reconstruct 2D talking head animation based on a single input image and an audio stream. Our method first produces facial landmarks motion from the audio stream and constructs the intermediate style patterns from the style reference images. We then feed both outputs into a style-aware image generator to generate the photo-realistic and fidelity 2D animation. In practice, our framework can extract the style information of a specific character and transfer it to any new static image for talking head animation. The intensive experimental results show that our method achieves better results than recent state-of-the-art approaches qualitatively and quantitatively. Our source code will be made publicly available.

Cite

Text

Pham et al. "Style Transfer for 2D Talking Head Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00745

Markdown

[Pham et al. "Style Transfer for 2D Talking Head Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/pham2024cvprw-style/) doi:10.1109/CVPRW63382.2024.00745

BibTeX

@inproceedings{pham2024cvprw-style,
  title     = {{Style Transfer for 2D Talking Head Generation}},
  author    = {Pham, Trong-Thang and Do, Tuong and Le, Nhat and Le, Ngan and Nguyen, Hung and Tjiputra, Erman and Tran, Quang and Nguyen, Anh},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {7500-7509},
  doi       = {10.1109/CVPRW63382.2024.00745},
  url       = {https://mlanthology.org/cvprw/2024/pham2024cvprw-style/}
}