Song, Zhinan

1 publications

ICLR 2025 Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions? Boshen Xu, Ziheng Wang, Yang Du, Zhinan Song, Sipeng Zheng, Qin Jin