Kang, Weitai

6 publications

NeurIPS 2025 InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction Bin Lei, Weitai Kang, Zijian Zhang, Winson Chen, Xi Xie, Shan Zuo, Mimi Xie, Ali Payani, Mingyi Hong, Yan Yan, Caiwen Ding
ICLR 2025 Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention Weitai Kang, Mengxue Qu, Jyoti Kini, Yunchao Wei, Mubarak Shah, Yan Yan
ICCV 2025 Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning Weitai Kang, Haifeng Huang, Yuzhang Shang, Mubarak Shah, Yan Yan
CVPR 2024 On the Faithfulness of Vision Transformer Explanations Junyi Wu, Weitai Kang, Hao Tang, Yuan Hong, Yan Yan
ECCV 2024 SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding Weitai Kang, Gaowen Liu, Mubarak Shah, Yan Yan
CVPR 2024 Token Transformation Matters: Towards Faithful Post-Hoc Explanation for Vision Transformer Junyi Wu, Bin Duan, Weitai Kang, Hao Tang, Yan Yan