Chen, Zhenfang

27 publications

ICML 2025 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory W. Wornell, Subhro Das, David Daniel Cox, Chuang Gan
ICLR 2025 Scaling Autonomous Agents via Automatic Reward Modeling and Planning Zhenfang Chen, Delin Chen, Rui Sun, Wenjun Liu, Chuang Gan
CVPR 2025 Scene-Agnostic Pose Regression for Visual Localization Junwei Zheng, Ruiping Liu, Yufan Chen, Zhenfang Chen, Kailun Yang, Jiaming Zhang, Rainer Stiefelhagen
ICML 2025 Visual and Domain Knowledge for Professional-Level Graph-of-Thought Medical Reasoning Rina Bao, Shilong Dong, Zhenfang Chen, Sheng He, Ellen Grant, Yangming Ou
ICLR 2024 CoVLM: Composing Visual Entities and Relationships in Large Language Models via Communicative Decoding Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan
ICML 2024 ContPhy: Continuum Physical Concept Learning and Reasoning from Videos Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan
ECCV 2024 FlexAttention for Efficient High-Resolution Vision-Language Models Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan
ICLR 2024 GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules Zhenfang Chen, Rui Sun, Wenjun Liu, Yining Hong, Chuang Gan
ICLR 2024 SALMON: Self-Alignment with Instructable Reward Models Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Daniel Cox, Yiming Yang, Chuang Gan
CVPR 2024 SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge Andong Wang, Bo Wu, Sunli Chen, Zhenfang Chen, Haotian Guan, Wei-Ning Lee, Li Erran Li, Chuang Gan
AAAI 2024 Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Zhiqing Sun, Dan Gutfreund, Chuang Gan
CVPR 2023 3D Concept Learning and Reasoning from Multi-View Images Yining Hong, Chunru Lin, Yilun Du, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2023 3D-LLM: Injecting the 3D World into Large Language Models Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan
CVPR 2023 Mod-SQuAD: Designing Mixtures of Experts as Modular Multi-Task Learners Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan
NeurIPS 2023 Physion++: Evaluating Physical Scene Understanding That Requires Online Inference of Different Physical Properties Hsiao-Yu Tung, Mingyu Ding, Zhenfang Chen, Daniel Bear, Chuang Gan, Josh Tenenbaum, Dan Yamins, Judith Fan, Kevin Smith
ICLR 2023 Planning with Large Language Models for Code Generation Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2023 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
ICCV 2023 TextPSG: Panoptic Scene Graph Generation from Textual Descriptions Chengyang Zhao, Yikang Shen, Zhenfang Chen, Mingyu Ding, Chuang Gan
CVPR 2023 Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
ICLR 2022 ComPhy: Compositional Physical Reasoning of Objects and Events from Videos Zhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan
CoRL 2022 Embodied Concept Learner: Self-Supervised Learning of Concepts and Mapping Through Instruction Following Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
ECCV 2022 PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong
NeurIPSW 2022 Planning with Large Language Models for Code Generation Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2022 S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow Under a Single Viewpoint Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong
NeurIPS 2021 Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Josh Tenenbaum, Chuang Gan
ICLR 2021 Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning Zhenfang Chen, Jiayuan Mao, Jiajun Wu, Kwan-Yee Kenneth Wong, Joshua B. Tenenbaum, Chuang Gan
CVPR 2021 The Blessings of Unlabeled Background in Untrimmed Videos Yuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang