4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface

Abstract

Tracking non-rigidly deforming scenes using range sensors has numerous applications including computer vision, AR/VR, and robotics. However, due to occlusions and physical limitations of range sensors, existing methods only handle the visible surface, thus causing discontinuities and incompleteness in the motion field. To this end, we introduce 4DComplete, a novel data-driven approach that estimates the non-rigid motion for the unobserved geometry. 4DComplete takes as input a partial shape and motion observation, extracts 4D time-space embedding, and jointly infers the missing geometry and motion field using a sparse fully-convolutional network. For network training, we constructed a large-scale synthetic dataset called DeformingThings4D, which consists of 1,972 animation sequences spanning 31 different animals or humanoid categories with dense 4D annotation. Experiments show that 4DComplete 1) reconstructs high-resolution volumetric shape and motion field from a partial observation, 2) learns an entangled 4D feature representation that benefits both shape and motion estimation, 3) yields more accurate and natural deformation than classic non-rigid priors such as As-RigidAs-Possible (ARAP) deformation, and 4) generalizes well to unseen objects in real-world sequences.

Cite

Text

Li et al. "4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.01247

Markdown

[Li et al. "4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/li2021iccv-4dcomplete/) doi:10.1109/ICCV48922.2021.01247

BibTeX

@inproceedings{li2021iccv-4dcomplete,
  title     = {{4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface}},
  author    = {Li, Yang and Takehara, Hikari and Taketomi, Takafumi and Zheng, Bo and Nießner, Matthias},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {12706-12716},
  doi       = {10.1109/ICCV48922.2021.01247},
  url       = {https://mlanthology.org/iccv/2021/li2021iccv-4dcomplete/}
}