Three-Stream Convolutional Neural Network with Multi-Task and Ensemble Learning for 3D Action Recognition

Abstract

In this paper, we propose a three-stream convolutional neural network (3SCNN) for action recognition from skeleton sequences, which aims to thoroughly and fully exploit the skeleton data by extracting, learning, fusing and inferring multiple motion-related features, including 3D joint positions and joint displacements across adjacent frames as well as oriented bone segments. The proposed 3SCNN involves three sequential stages. The first stage enriches three independently extracted features by co-occurrence feature learning. The second stage involves multi-channel pairwise fusion to take advantage of the complementary and diverse nature among three features. The third stage is a multi-task and ensemble learning network to further improve the generalization ability of 3SCNN. Experimental results on the standard dataset show the effectiveness of our proposed multi-stream feature learning, fusion and inference method for skeleton-based 3D action recognition.

Cite

Text

Liang et al. "Three-Stream Convolutional Neural Network with Multi-Task and Ensemble Learning for 3D Action Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019. doi:10.1109/CVPRW.2019.00123

Markdown

[Liang et al. "Three-Stream Convolutional Neural Network with Multi-Task and Ensemble Learning for 3D Action Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/liang2019cvprw-threestream/) doi:10.1109/CVPRW.2019.00123

BibTeX

@inproceedings{liang2019cvprw-threestream,
  title     = {{Three-Stream Convolutional Neural Network with Multi-Task and Ensemble Learning for 3D Action Recognition}},
  author    = {Liang, Duohan and Fan, Guoliang and Lin, Guangfeng and Chen, Wanjun and Pan, Xiaorong and Zhu, Hong},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2019},
  pages     = {934-940},
  doi       = {10.1109/CVPRW.2019.00123},
  url       = {https://mlanthology.org/cvprw/2019/liang2019cvprw-threestream/}
}