Xu, Zhiwei

27 publications

ICLR 2026 Peak-Return Greedy Slicing: Subtrajectory Selection for Transformer-Based Offline RL Zhiwei Xu, Miduo Cui, Dapeng Li, Zhihao Liu, Haifeng Zhang, Hangyu Mao, Guoliang Fan, Bin Zhang
ICLR 2026 Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Mike A Merrill, Alexander Glenn Shaw, Nicholas Carlini, Boxuan Li, Harsh Raj, Ivan Bercovich, Lin Shi, Jeong Yeon Shin, Thomas Walshe, E. Kelly Buchanan, Junhong Shen, Guanghao Ye, Haowei Lin, Jason Poulos, Maoyu Wang, Marianna Nezhurina, Di Lu, Orfeas Menis Mastromichalakis, Zhiwei Xu, Zizhao Chen, Yue Liu, Robert Zhang, Leon Liangyu Chen, Anurag Kashyap, Jan-Lucas Uslu, Jeffrey Li, Jianbo Wu, Minghao Yan, Song Bian, Vedang Sharma, Ke Sun, Steven Dillmann, Akshay Anand, Andrew Lanpouthakoun, Bardia Koopah, Changran Hu, Etash Kumar Guha, Gabriel H. S. Dreiman, Jiacheng Zhu, Karl Krauth, Li Zhong, Niklas Muennighoff, Robert Kwesi Amanfu, Shangyin Tan, Shreyas Pimpalgaonkar, Tushar Aggarwal, Xiangning Lin, Xin Lan, Xuandong Zhao, Yiqing Liang, Yuanli Wang, Zilong Wang, Changzhi Zhou, David Heineman, Hange Liu, Harsh Trivedi, John Yang, Junhong Lin, Manish Shetty, Michael Yang, Nabil Omi, Negin Raoof, Shanda Li, Terry Yue Zhuo, Wuwei Lin, Yiwei Dai, Yuxin Wang, Wenhao Chai, Shang Zhou, Dariush Wahdany, Ziyu She, Jiaming Hu, Zhikang Dong, Yuxuan Zhu, Sasha Cui, Ahson Saiyed, Arinbjörn Kolbeinsson, Christopher Michael Rytting, Ryan Marten, Yixin Wang, Jenia Jitsev, Alex Dimakis, Andy Konwinski, Ludwig Schmidt
NeurIPS 2025 Belief-Calibrated Multi-Agent Consensus Seeking for Complex NLP Tasks Wentao Deng, Jiahuan Pei, Zhiwei Xu, Zhaochun Ren, Zhumin Chen, Pengjie Ren
NeurIPS 2025 Benign Overfitting in Single-Head Attention Roey Magen, Shuning Shang, Zhiwei Xu, Spencer Frei, Wei Hu, Gal Vardi
ICCV 2025 DAA*: Deep Angular a Star for Image-Based Path Planning Zhiwei Xu
AAAI 2025 Efficient Communication in Multi-Agent Reinforcement Learning with Implicit Consensus Generation Dapeng Li, Na Lou, Zhiwei Xu, Bin Zhang, Guoliang Fan
AAAI 2025 Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition Changwei Wang, Shunpeng Chen, Yukun Song, Rongtao Xu, Zherui Zhang, Jiguang Zhang, Haoran Yang, Yu Zhang, Kexue Fu, Shide Du, Zhiwei Xu, Longxiang Gao, Li Guo, Shibiao Xu
ICLR 2025 Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model Zhiwei Xu, Zhiyu Ni, Yixin Wang, Wei Hu
ICML 2025 Reidentify: Context-Aware Identity Generation for Contextual Multi-Agent Reinforcement Learning Zhiwei Xu, Kun Hu, Xin Xin, Weiliang Meng, Yiwei Shi, Hangyu Mao, Bin Zhang, Dapeng Li, Jiangjin Yin
AAAI 2024 Adversarial Purification with the Manifold Hypothesis Zhaoyuan Yang, Zhiwei Xu, Jing Zhang, Richard I. Hartley, Peter H. Tu
ICLR 2024 Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data Zhiwei Xu, Yutong Wang, Spencer Frei, Gal Vardi, Wei Hu
NeurIPSW 2024 Benign Overfitting in Single-Head Attention Roey Magen, Shuning Shang, Zhiwei Xu, Spencer Frei, Wei Hu, Gal Vardi
ICLRW 2024 Controlling Large Language Model-Based Agents for Large-Scale Decision-Making: An Actor-Critic Approach Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Guoliang Fan, Lijuan Li
ICLR 2024 IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models Zhaoyuan Yang, Zhengyang Yu, Zhiwei Xu, Jaskirat Singh, Jing Zhang, Dylan Campbell, Peter Tu, Richard Hartley
ICML 2024 Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan
ACML 2024 Vision Transformer with High Spatial Structure Sensitivity Zhiwei Xu
NeurIPSW 2023 Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data Zhiwei Xu, Yutong Wang, Spencer Frei, Gal Vardi, Wei Hu
AAAI 2023 Consensus Learning for Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan
NeurIPS 2023 Dual Self-Awareness Value Decomposition Framework Without Individual Global Max for Cooperative MARL Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan
AAAI 2023 HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan
IJCAI 2023 Inducing Stackelberg Equilibrium Through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan
ICMLW 2023 PMaF: Deep Declarative Layers for Principal Matrix Features Zhiwei Xu, Hao Wang, Yanbin Liu, Stephen Gould
AAAI 2023 Rethinking Label Refurbishment: Model Robustness Under Label Noise Yangdi Lu, Zhiwei Xu, Wenbo He
NeurIPSW 2023 TPTU: Task Planning and Tool Usage of Large Language Model-Based AI Agents Jingqing Ruan, YiHong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Du Guo Qing, Shi Shiwei, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao
ICMLW 2023 Towards Understanding Gradient Approximation in Equality Constrained Deep Declarative Networks Stephen Gould, Ming Xu, Zhiwei Xu, Yanbin Liu
NeurIPS 2022 Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Baiia, Guoliang Fan
IJCAI 2011 Effective and Efficient Microprocessor Design Space Exploration Using Unlabeled Design Configurations Qi Guo, Tianshi Chen, Yunji Chen, Zhi-Hua Zhou, Weiwu Hu, Zhiwei Xu