LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition

Abstract

Gesture recognition plays an important role in natural human-computer interaction and sign language recognition. Existing research on gesture recognition is limited to close-range interaction such as vehicle gesture control and face-to-face communication. To apply gesture recognition to long-distance interactive scenes such as meetings and smart homes, a large RGB-D video dataset LD-ConGR is established in this paper. LD-ConGR is distinguished from existing gesture datasets by its long-distance gesture collection, fine-grained annotations, and high video quality. Specifically, 1) the farthest gesture provided by the LD-ConGR is captured 4m away from the camera while existing gesture datasets collect gestures within 1m from the camera; 2) besides the gesture category, the temporal segmentation of gestures and hand location are also annotated in LD-ConGR; 3) videos are captured at high resolution (1280x720 for color streams and 640x576 for depth streams) and high frame rate (30 fps). On top of the LD-ConGR, a series of experimental and studies are conducted, and the proposed gesture region estimation and key frame sampling strategies are demonstrated to be effective in dealing with long-distance gesture recognition and the uncertainty of gesture duration. The dataset and experimental results presented in this paper are expected to boost the research of long-distance gesture recognition. The dataset is available at https://github.com/Diananini/LD-ConGR-CVPR2022.

Cite

Text

Liu et al. "LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00330

Markdown

[Liu et al. "LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/liu2022cvpr-ldcongr/) doi:10.1109/CVPR52688.2022.00330

BibTeX

@inproceedings{liu2022cvpr-ldcongr,
  title     = {{LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition}},
  author    = {Liu, Dan and Zhang, Libo and Wu, Yanjun},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {3304-3312},
  doi       = {10.1109/CVPR52688.2022.00330},
  url       = {https://mlanthology.org/cvpr/2022/liu2022cvpr-ldcongr/}
}