The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition
Abstract
Sign Language Processing (SLP) provides the foundation for a more inclusive future in language technology; however, the field must first overcome significant challenges. This work addresses multi-view isolated sign recognition (MV-ISR), emphasizing the critical importance of 3D awareness for real-world SLP applications. We introduce a new benchmark for MV-ISR, the NGT200 dataset, and define MV-ISR as a distinct task from single-view ISR. We showcase the benefits of including synthetic data, and propose to condition sign representations on the spatial symmetries inherent to the visual modality of sign language. We enhance MV-ISR performance by 8%-22% using a geometrically grounded model compared to the SL-GCN baseline.
Cite
Text
Ranum et al. "The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition." ICML 2024 Workshops: GRaM, 2024.Markdown
[Ranum et al. "The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition." ICML 2024 Workshops: GRaM, 2024.](https://mlanthology.org/icmlw/2024/ranum2024icmlw-ngt200/)BibTeX
@inproceedings{ranum2024icmlw-ngt200,
title = {{The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition}},
author = {Ranum, Oline and Wessels, David and Otterspeer, Gomèr and Bekkers, Erik J and Roelofsen, Floris and Andersen, Jari I.},
booktitle = {ICML 2024 Workshops: GRaM},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/ranum2024icmlw-ngt200/}
}