Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
Abstract
In this paper we present a novel approach for bottom-up multi-person 3D human pose estimation from monocular RGB images. We propose to use high resolution volumetric heatmaps to model joint locations, devising a simple and effective compression method to drastically reduce the size of this representation. At the core of the proposed method lies our Volumetric Heatmap Autoencoder, a fully-convolutional network tasked with the compression of ground-truth heatmaps into a dense intermediate representation. A second model, the Code Predictor, is then trained to predict these codes, which can be decompressed at test time to re-obtain the original representation. Our experimental evaluation shows that our method performs favorably when compared to state of the art on both multi-person and single-person 3D human pose estimation datasets and, thanks to our novel compression strategy, can process full-HD images at the constant runtime of 8 fps regardless of the number of subjects in the scene. Code and models are publicly available.
Cite
Text
Fabbri et al. "Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00723Markdown
[Fabbri et al. "Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/fabbri2020cvpr-compressed/) doi:10.1109/CVPR42600.2020.00723BibTeX
@inproceedings{fabbri2020cvpr-compressed,
title = {{Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation}},
author = {Fabbri, Matteo and Lanzi, Fabio and Calderara, Simone and Alletto, Stefano and Cucchiara, Rita},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2020},
doi = {10.1109/CVPR42600.2020.00723},
url = {https://mlanthology.org/cvpr/2020/fabbri2020cvpr-compressed/}
}