R2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy
Abstract
Human life is filled with articulated objects. Previous works for estimating the pose of category-level articulated objects rely on costly 3D point clouds or RGB-D images. In this paper, our goal is to estimate category-level articulation poses from a single RGB image, where we propose R2-Art, a novel category-level Articulation pose estimation framework from a single RGB image and a cascade Render strategy. Given an RGB image as input, R2-Art estimates per-part 6D pose for the articulation. Specifically, we design parallel regression branches tailored to generate camera-to-root translation and rotation. Using the predicted joint states, we perform PC prior transformation and deformation with a joint-centric modeling approach. For further refinement, a cascade render strategy is proposed for projecting the 3D deformed prior onto the 2D mask. Extensive experiments are provided to validate our R2-Art on various datasets ranging from synthetic datasets to real-world scenarios, demonstrating the superior performance and robustness of the R2-Art. We believe that this work has the potential to be applied in many fields including robotics, embodied intelligence, and augmented reality.
Cite
Text
Zhang et al. "R2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.33083Markdown
[Zhang et al. "R2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhang2025aaai-r/) doi:10.1609/AAAI.V39I9.33083BibTeX
@inproceedings{zhang2025aaai-r,
title = {{R2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy}},
author = {Zhang, Li and Jiang, Haonan and Huo, Yukang and Zhong, Yan and Wang, Jianan and Wang, Xue and Wang, Rujing and Liu, Liu},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {9985-9993},
doi = {10.1609/AAAI.V39I9.33083},
url = {https://mlanthology.org/aaai/2025/zhang2025aaai-r/}
}