D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Abstract
Reconstructing hand-held objects from a single RGB image is a challenging task in computer vision. In contrast to prior works that utilize deterministic modeling paradigms, we employ a point cloud denoising diffusion model to account for the probabilistic nature of this problem. In the core, we introduce centroid-fixed Dual-Stream Conditional diffusion for monocular hand-held object reconstruction (), tackling two predominant challenges. First, to avoid the object centroid from deviating, we utilize a novel hand-constrained centroid fixing paradigm, enhancing the stability of diffusion and reverse processes and the precision of feature projection. Second, we introduce a dual-stream denoiser to semantically and geometrically model hand-object interactions with a novel unified hand-object semantic embedding, enhancing the reconstruction performance of the hand-occluded region of the object. Experiments on the synthetic ObMan dataset and three real-world datasets HO3D, MOW and DexYCB demonstrate that our approach can surpass all other state-of-the-art methods.
Cite
Text
Fu et al. "D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73397-0_22Markdown
[Fu et al. "D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/fu2024eccv-dsco/) doi:10.1007/978-3-031-73397-0_22BibTeX
@inproceedings{fu2024eccv-dsco,
title = {{D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction}},
author = {Fu, Bowen and Wang, Gu and Zhang, Chenyangguang and Di, Yan and Huang, Ziqin and Leng, Zhiying and Manhardt, Fabian and Ji, Xiangyang and Tombari, Federico},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73397-0_22},
url = {https://mlanthology.org/eccv/2024/fu2024eccv-dsco/}
}