Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text

Abstract

Sign Languages (SL) serve as the primary mode of communication for the Deaf and Hard of Hearing communities. Deep learning methods for SL recognition and translation have achieved promising results. However Sign Language Production (SLP) poses a challenge as the generated motions must be realistic and have precise semantic meaning. Most SLP methods rely on 2D data which hinders their realism. In this work a diffusion-based SLP model is trained on a curated large-scale dataset of 4D signing avatars and their corresponding text transcripts. The proposed method can generate dynamic sequences of 3D avatars from an unconstrained domain of discourse using a diffusion process formed on a novel and anatomically informed graph neural network defined on the SMPL-X body skeleton. Through quantitative and qualitative experiments we show that the proposed method considerably outperforms previous methods of SLP. This work makes an important step towards realistic neural sign avatars bridging the communication gap between Deaf and hearing communities.

Cite

Text

Baltatzis et al. "Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00194

Markdown

[Baltatzis et al. "Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/baltatzis2024cvpr-neural/) doi:10.1109/CVPR52733.2024.00194

BibTeX

@inproceedings{baltatzis2024cvpr-neural,
  title     = {{Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text}},
  author    = {Baltatzis, Vasileios and Potamias, Rolandos Alexandros and Ververas, Evangelos and Sun, Guanxiong and Deng, Jiankang and Zafeiriou, Stefanos},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {1985-1995},
  doi       = {10.1109/CVPR52733.2024.00194},
  url       = {https://mlanthology.org/cvpr/2024/baltatzis2024cvpr-neural/}
}