Cai, Shaofei

16 publications

ICLR 2025 GROOT-2: Weakly Supervised Multimodal Instruction Following Agents Shaofei Cai, Bowei Zhang, Zihao Wang, Haowei Lin, Xiaojian Ma, Anji Liu, Yitao Liang
ICCV 2025 Open-World Skill Discovery from Unsegmented Demonstration Videos Jingwen Deng, Zihao Wang, Shaofei Cai, Anji Liu, Yitao Liang
CVPR 2025 ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting Shaofei Cai, Zihao Wang, Kewei Lian, Zhancun Mu, Xiaojian Ma, Anji Liu, Yitao Liang
ICMLW 2024 GROOT-1.5: Learning to Follow Multi-Modal Instructions from Weak Supervision Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
ICLR 2024 GROOT: Learning to Follow Instructions by Watching Gameplay Videos Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
NeurIPS 2024 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li, Anji Liu, Xiaojian Ma, Yitao Liang
ICMLW 2024 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li, Anji Liu, Xiaojian Ma, Yitao Liang
NeurIPSW 2024 ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Shaofei Cai, Zihao Wang, Kewei Lian, Zhancun Mu, Xiaojian Ma, Anji Liu, Yitao Liang
NeurIPS 2023 Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
WACV 2023 DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi
NeurIPSW 2023 GROOT: Learning to Follow Instructions by Watching Gameplay Videos Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
NeurIPSW 2023 GROOT: Learning to Follow Instructions by Watching Gameplay Videos Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
NeurIPSW 2023 JARVIS-1: Open-World Multi-Task Agents with Memory-Augmented Multimodal Language Models Zihao Wang, Shaofei Cai, Anji Liu, Xiaojian Ma, Yitao Liang
CVPR 2023 Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction Shaofei Cai, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
CVPR 2022 Automatic Relation-Aware Graph Network Proliferation Shaofei Cai, Liang Li, Xinzhe Han, Jiebo Luo, Zheng-Jun Zha, Qingming Huang
CVPR 2021 Rethinking Graph Neural Architecture Search from Message-Passing Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang