MHEntropy: Entropy Meets Multiple Hypotheses for Pose and Shape Recovery
Abstract
For monocular RGB-based 3D pose and shape estimation, multiple solutions are often feasible due to factors like occlusion and truncation. This work presents a multi-hypothesis probabilistic framework by optimizing the Kullback-Leibler divergence (KLD) between the data and model distribution. Our formulation reveals a connection between the pose entropy and diversity in the multiple hypotheses that has been neglected by previous works. For a comprehensive evaluation, besides the best hypothesis (BH) metric, we factor in visibility for evaluating diversity. Additionally, our framework is label-friendly, in that it can be learned from only partial 2D keypoints, e.g., those that are visible. Experiments on both ambiguous and real-world benchmarks demonstrate that our method outperforms other state-of-the-art multi-hypothesis methods in a comprehensive evaluation. The project page is at https://gloryyrolg.github.io/MHEntropy.
Cite
Text
Chen et al. "MHEntropy: Entropy Meets Multiple Hypotheses for Pose and Shape Recovery." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01363Markdown
[Chen et al. "MHEntropy: Entropy Meets Multiple Hypotheses for Pose and Shape Recovery." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/chen2023iccv-mhentropy/) doi:10.1109/ICCV51070.2023.01363BibTeX
@inproceedings{chen2023iccv-mhentropy,
title = {{MHEntropy: Entropy Meets Multiple Hypotheses for Pose and Shape Recovery}},
author = {Chen, Rongyu and Yang, Linlin and Yao, Angela},
booktitle = {International Conference on Computer Vision},
year = {2023},
pages = {14840-14849},
doi = {10.1109/ICCV51070.2023.01363},
url = {https://mlanthology.org/iccv/2023/chen2023iccv-mhentropy/}
}