Xu, Weikai
2 publications
ICLR
2026
SMAN-Bench: A Cross-System Benchmark for Mobile Agents Under Single- and Multi-Path, Ambiguous, and Noisy Tasks
Weikai Xu, Zhizheng Jiang, Yuxuan Liu, Pengzhi Gao, Wei Liu, Jian Luan, Yunxin Liu, Yuanchun Li, Bin Wang, Bo An