FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles

Cite

Text

Zhang et al. "FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I24.34786

Markdown

[Zhang et al. "FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-facespeak/) doi:10.1609/AAAI.V39I24.34786

BibTeX

@inproceedings{zhang2025aaai-facespeak,
  title     = {{FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles}},
  author    = {Zhang, Tian-Hao and Zhang, Jiawei and Wang, Jun and Qian, Xinyuan and Yin, Xu-Cheng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {25922-25930},
  doi       = {10.1609/AAAI.V39I24.34786},
  url       = {https://mlanthology.org/aaai/2025/zhang2025aaai-facespeak/}
}