Multi-Modality Gesture Detection and Recognition with Un-Supervision, Randomization and Discrimination
Abstract
We describe in this paper our gesture detection and recognition system for the 2014 ChaLearn Looking at People (Track 3: Gesture Recognition) organized by ChaLearn in conjunction with the ECCV 2014 conference. The competition’s task was to learn a vacabulary of 20 types of Italian gestures and detect them in sequences. Our system adopts a multi-modality approach for detecting as well as recognizing the gestures. The goal of our approach is to identify semantically meaningful contents from dense sampling spatio-temporal feature space for gesture recognition. To achieve this, we develop three concepts under the random forest framework: un-supervision; discrimination; and randomization. Un-supervision learns spatio-temporal features from two channels (grayscale and depth) of RGB-D video in an unsupervised way. Discrimination extracts the information in dense sampling spatio-temporal space effectively. Randomization explores the dense sampling spatio-temporal feature space efficiently. An evaluation of our approach shows that we achieve a mean Jaccard Index of $0.6489$ , and a mean average accuracy of $90.3\,\%$ over the test dataset.
Cite
Text
Chen et al. "Multi-Modality Gesture Detection and Recognition with Un-Supervision, Randomization and Discrimination." European Conference on Computer Vision Workshops, 2014. doi:10.1007/978-3-319-16178-5_43Markdown
[Chen et al. "Multi-Modality Gesture Detection and Recognition with Un-Supervision, Randomization and Discrimination." European Conference on Computer Vision Workshops, 2014.](https://mlanthology.org/eccvw/2014/chen2014eccvw-multimodality/) doi:10.1007/978-3-319-16178-5_43BibTeX
@inproceedings{chen2014eccvw-multimodality,
title = {{Multi-Modality Gesture Detection and Recognition with Un-Supervision, Randomization and Discrimination}},
author = {Chen, Guang and Clarke, Daniel and Giuliani, Manuel and Gaschler, Andre and Wu, Di and Weikersdorfer, David and Knoll, Alois C.},
booktitle = {European Conference on Computer Vision Workshops},
year = {2014},
pages = {608-622},
doi = {10.1007/978-3-319-16178-5_43},
url = {https://mlanthology.org/eccvw/2014/chen2014eccvw-multimodality/}
}