Hand Gesture Recognition Using Dual Graph Hierarchical Edges Representation and Graph Transformer Network

Abstract

Hand gestures can be studied by analyzing the location and movement of closely spaced and interacting joints, thus requiring tailored methods to accurately capture the joints intricate spatial and temporal dynamics. To address this particularity, we propose an approach for dynamic hand gesture recognition, called Graph Hierarchical Edge Representation-Graph Transformer (GHER-GT). Our approach utilizes a skeleton-based method with an end-to-end graph model integrated within an efficient spatio-temporal block. This spatio-temporal processing framework leverages multilevel spatial feature representation of hand joints, highlighting both directly connected and distant joints. Subsequently, an attention mechanism within the graph transformer model captures the temporal dynamics effectively. The higher-order information learning models essential interactions between hand joints selectively, enhancing computational efficiency and interpretability. An in-depth ablation study underscores the significance of our graph hierarchical approach and the impact of different architectural configurations. Experimental evaluations on three benchmark datasets, namely IPN, SHREC’17, and BRIAREO demonstrate the accuracy of our approach is competitive or even surpasses that reported in the state-of-the-art for the task of dynamic hand gesture recognition. The code is available at: https://github.com/Memmi36/GHER-GT .

Cite

Text

Memmi et al. "Hand Gesture Recognition Using Dual Graph Hierarchical Edges Representation and Graph Transformer Network." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-92591-7_4

Markdown

[Memmi et al. "Hand Gesture Recognition Using Dual Graph Hierarchical Edges Representation and Graph Transformer Network." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/memmi2024eccvw-hand/) doi:10.1007/978-3-031-92591-7_4

BibTeX

@inproceedings{memmi2024eccvw-hand,
  title     = {{Hand Gesture Recognition Using Dual Graph Hierarchical Edges Representation and Graph Transformer Network}},
  author    = {Memmi, Mohamed Youssef and Slama, Rim and Berretti, Stefano},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {53-68},
  doi       = {10.1007/978-3-031-92591-7_4},
  url       = {https://mlanthology.org/eccvw/2024/memmi2024eccvw-hand/}
}