EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning

Xie, Binzhu; Qiu, Shi; Zhang, Sicheng; Wang, Yinqiao; Xu, Hao; Naseer, Muzammal; Fu, Chi-Wing; Heng, Pheng-Ann

EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning

Binzhu Xie, Shi Qiu, Sicheng Zhang, Yinqiao Wang, Hao Xu, Muzammal Naseer, Chi-Wing Fu, Pheng-Ann Heng

ICLR 2026

/iclr/2026/xie2026iclr-egohandicl/

Abstract

Robust 3D hand reconstruction is challenging in egocentric vision due to depth ambiguity, self-occlusion, and complex hand-object interactions. Prior works attempt to mitigate the challenges by scaling up training data or incorporating auxiliary cues, often falling short of effectively handling unseen contexts. In this paper, we introduce EgoHandICL, the first in-context learning (ICL) framework for 3D hand reconstruction that achieves strong semantic alignment, visual consistency, and robustness under challenging egocentric conditions. Specifically, we develop (i) complementary exemplar retrieval strategies guided by vision–language models (VLMs), (ii) an ICL-tailored tokenizer that integrates multimodal context, and (iii) a Masked Autoencoders (MAE)-based architecture trained with 3D hand–guided geometric and perceptual objectives. By conducting comprehensive experiments on the ARCTIC and EgoExo4D benchmarks, our EgoHandICL consistently demonstrates significant improvements over state-of-the-art 3D hand reconstruction methods. We further show EgoHandICL’s applicability by testing it on real-world egocentric cases and integrating it with EgoVLMs to enhance their hand–object interaction reasoning. Our code and data will be publicly available.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Xie et al. "EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning." International Conference on Learning Representations, 2026.

Markdown

[Xie et al. "EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/xie2026iclr-egohandicl/)

BibTeX

@inproceedings{xie2026iclr-egohandicl,
  title     = {{EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning}},
  author    = {Xie, Binzhu and Qiu, Shi and Zhang, Sicheng and Wang, Yinqiao and Xu, Hao and Naseer, Muzammal and Fu, Chi-Wing and Heng, Pheng-Ann},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/xie2026iclr-egohandicl/}
}