Ding, Yuxuan

3 publications

ICLR 2025 TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models Ziyao Shangguan, Chuhan Li, Yuxuan Ding, Yanan Zheng, Yilun Zhao, Tesca Fitzgerald, Arman Cohan
CoRL 2025 TReF-6: Inferring Task-Relevant Frames from a Single Demonstration for One-Shot Skill Generalization Yuxuan Ding, Shuangge Wang, Tesca Fitzgerald
NeurIPS 2023 The CLIP Model Is Secretly an Image-to-Prompt Converter Yuxuan Ding, Chunna Tian, Haoxuan Ding, Lingqiao Liu