AffordDexGrasp: Open-Set Language-Guided Dexterous Grasp with Generalizable-Instructive Affordance
Abstract
Language-guided robot dexterous generation enables robots to grasp and manipulate objects based on human commands. However, previous data-driven methods are hard to understand intention and execute grasping with unseen categories in the open set. In this work, we explore a new task, Open-set Language-guided Dexterous Grasp, and find that the main challenge is the huge gap between high-level human language semantics and low-level robot action. To solve this problem, we propose an Affordance Dexterous Grasp (AffordDexGrasp) framework, with the insight that bridging the gap with a new generalizable-instructive affordance representation. This affordance can generalize to unseen categories by leveraging the object's local structure and category-agnostic semantic attributes, thereby effectively guiding dexterous grasp generation. Built upon the affordance, our framework introduces Affordance Flow Matching (AFM) for affordance generation with language as input, and Grasp Flow Matching (GFM) for generating dexterous grasp with affordance as input. To evaluate our framework, we build an open-set table-top language-guided dexterous grasp dataset. Extensive experiments in the simulation and real worlds show that our framework surpasses all previous methods in both seen category and unseen category generalization.
Cite
Text
Wei et al. "AffordDexGrasp: Open-Set Language-Guided Dexterous Grasp with Generalizable-Instructive Affordance." International Conference on Computer Vision, 2025.Markdown
[Wei et al. "AffordDexGrasp: Open-Set Language-Guided Dexterous Grasp with Generalizable-Instructive Affordance." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/wei2025iccv-afforddexgrasp/)BibTeX
@inproceedings{wei2025iccv-afforddexgrasp,
title = {{AffordDexGrasp: Open-Set Language-Guided Dexterous Grasp with Generalizable-Instructive Affordance}},
author = {Wei, Yi-Lin and Lin, Mu and Lin, Yuhao and Jiang, Jian-Jian and Wu, Xiao-Ming and Zeng, Ling-An and Zheng, Wei-Shi},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {11818-11828},
url = {https://mlanthology.org/iccv/2025/wei2025iccv-afforddexgrasp/}
}