Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition
Abstract
Deep Learning architectures, albeit successful in mostcomputer vision tasks, were designed for data with an un-derlying Euclidean structure, which is not usually fulfilledsince pre-processed data may lie on a non-linear space.In this paper, we propose a geometry aware deep learn-ing approach using rigid and non rigid transformation opti-mization for skeleton-based action recognition. Skeleton se-quences are first modeled as trajectories on Kendall's shapespace and then mapped to the linear tangent space. The re-sulting structured data are then fed to a deep learning archi-tecture, which includes a layer that optimizes over rigid andnon rigid transformations of the 3D skeletons, followed bya CNN-LSTM network. The assessment on two large scaleskeleton datasets, namely NTU-RGB+D and NTU-RGB+D120, has proven that the proposed approach outperformsexisting geometric deep learning methods and exceeds re-cently published approaches with respect to the majority of configurations.
Cite
Text
Friji et al. "Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.01238Markdown
[Friji et al. "Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/friji2021iccv-geometric/) doi:10.1109/ICCV48922.2021.01238BibTeX
@inproceedings{friji2021iccv-geometric,
title = {{Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition}},
author = {Friji, Rasha and Drira, Hassen and Chaieb, Faten and Kchok, Hamza and Kurtek, Sebastian},
booktitle = {International Conference on Computer Vision},
year = {2021},
pages = {12611-12620},
doi = {10.1109/ICCV48922.2021.01238},
url = {https://mlanthology.org/iccv/2021/friji2021iccv-geometric/}
}