Reconstruction of 3D Interaction Models from Images Using Shape Prior

Abstract

We investigate the reconstruction of 3D human-object interactions from images, encompassing 3D human shape and pose estimation as well as object shape and pose estimation. To address this task, we introduce an autoregressive transformer-based variational autoencoder capable of learning a robust shape prior from extensive 3D shape datasets. Additionally, we leverage the reconstructed 3D human body as supplementary features for object shape and pose estimation. In contrast, prior methods only predict object pose and rely on shape templates for shape prediction. Experimental findings on the BEHAVE dataset underscore the effectiveness of our proposed approach, achieving a 40.7cm Chamfer distance and demonstrating the advantages of learning a shape prior.

Cite

Text

Mirmohammadi et al. "Reconstruction of 3D Interaction Models from Images Using Shape Prior." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00228

Markdown

[Mirmohammadi et al. "Reconstruction of 3D Interaction Models from Images Using Shape Prior." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/mirmohammadi2023iccvw-reconstruction/) doi:10.1109/ICCVW60793.2023.00228

BibTeX

@inproceedings{mirmohammadi2023iccvw-reconstruction,
  title     = {{Reconstruction of 3D Interaction Models from Images Using Shape Prior}},
  author    = {Mirmohammadi, Mehrshad and Saremi, Parham and Kuo, Yen-Ling and Wang, Xi},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2023},
  pages     = {2133-2139},
  doi       = {10.1109/ICCVW60793.2023.00228},
  url       = {https://mlanthology.org/iccvw/2023/mirmohammadi2023iccvw-reconstruction/}
}