Wang, Zekun

14 publications

ICLR 2025 AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Yiheng Xu, Dunjie Lu, Zhennan Shen, Junli Wang, Zekun Wang, Yuchen Mao, Caiming Xiong, Tao Yu
ICML 2025 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Yiheng Xu, Zekun Wang, Junli Wang, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong
ICLRW 2025 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Yiheng Xu, Zekun Wang, Junli Wang, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong
NeurIPS 2025 Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery Zekun Wang, Ethan Haarer, Tianyi Zhu, Zhiyi Dai, Christopher J. MacLellan
NeurIPS 2025 Gated Attention for Large Language Models: Non-Linearity, Sparsity, and Attention-Sink-Free Zihan Qiu, Zekun Wang, Bo Zheng, Zeyu Huang, Kaiyue Wen, Songlin Yang, Rui Men, Le Yu, Fei Huang, Suozhi Huang, Dayiheng Liu, Jingren Zhou, Junyang Lin
ICLR 2025 Improved Diffusion-Based Generative Model with Better Adversarial Robustness Zekun Wang, Mingyang Yi, Shuchen Xue, Zhenguo Li, Ming Liu, Bing Qin, Zhi-Ming Ma
NeurIPS 2025 Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Tianbao Xie, Jiaqi Deng, Xiaochuan Li, Junlin Yang, Haoyuan Wu, Jixuan Chen, Wenjing Hu, Xinyuan Wang, Yuhui Xu, Zekun Wang, Yiheng Xu, Junli Wang, Doyen Sahoo, Tao Yu, Caiming Xiong
ICMLW 2024 Babysit a Language Model from Scratch: Interactive Language Learning by Trials and Demonstrations Ziqiao Ma, Zekun Wang, Joyce Chai
NeurIPS 2024 Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation Jingchang Chen, Hongxuan Tang, Zheng Chu, Qianglong Chen, Zekun Wang, Ming Liu, Bing Qin
IJCAI 2024 GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension Jiafeng Liang, Shixin Jiang, Zekun Wang, Haojie Pan, Zerui Chen, Zheng Chu, Ming Liu, Ruiji Fu, Zhongyuan Wang, Bing Qin
NeurIPS 2024 II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Ziqiang Liu, Feiteng Fang, Xi Feng, Xinrun Du, Chenhao Zhang, Zekun Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli, Zhigang Zheng, Chengming Li, Xiping Hu, Ruifeng Xu, Xiaojun Chen, Min Yang, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang, Shiwen Ni
NeurIPS 2024 RoleAgent: Building, Interacting, and Benchmarking High-Quality Role-Playing Agents from Scripts Jiaheng Liu, Zehao Ni, Haoran Que, Tao Sun, Zekun Wang, Jian Yang, Jiakai Wang, Hongcheng Guo, Zhongyuan Peng, Ge Zhang, Jiayi Tian, Xingyuan Bu, Ke Xu, Wenge Rong, Junran Peng, Zhaoxiang Zhang
CVPRW 2023 Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications Weiyu Feng, Seth Z. Zhao, Chuanyu Pan, Adam Chang, Yichen Chen, Zekun Wang, Allen Y. Yang
IJCAI 2023 GTR: A Grafting-Then-Reassembling Framework for Dynamic Scene Graph Generation Jiafeng Liang, Yuxin Wang, Zekun Wang, Ming Liu, Ruiji Fu, Zhongyuan Wang, Bing Qin