Zhao, Wentian

12 publications

ICLR 2026 HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation Peilin Wu, Mian Zhang, Kun Wan, Wentian Zhao, Kaiyu He, Xinya Du, Zhiyu Chen
ICLR 2026 Vision-Zero: Scalable VLM Self-Evolution via Multi-Agent Self-Play Qinsi Wang, Bo Liu, Tianyi Zhou, Jing Shi, Yueqian Lin, Yiran Chen, Hai Helen Li, Kun Wan, Wentian Zhao
NeurIPS 2025 Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference Jiayi Yuan, Hao Li, Xinheng Ding, Wenya Xie, Yu-Jhe Li, Wentian Zhao, Kun Wan, Jing Shi, Xia Hu, Zirui Liu
CVPR 2024 DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-Based 3D Vision Lu Ling, Yichen Sheng, Zhi Tu, Wentian Zhao, Cheng Xin, Kun Wan, Lantao Yu, Qianyu Guo, Zixun Yu, Yawen Lu, Xuanmao Li, Xingpeng Sun, Rohan Ashok, Aniruddha Mukherjee, Hao Kang, Xiangrui Kong, Gang Hua, Tianyi Zhang, Bedrich Benes, Aniket Bera
AAAI 2024 Relational Distant Supervision for Image Captioning Without Image-Text Pairs Yayun Qi, Wentian Zhao, Xinxiao Wu
WACV 2021 How to Make a BLT Sandwich? Learning VQA Towards Understanding Web Instructional Videos Shaojie Wang, Wentian Zhao, Ziyi Kou, Jing Shi, Chenliang Xu
WACV 2021 Improve CAM with Auto-Adapted Segmentation and Co-Supervised Augmentation Ziyi Kou, Guofeng Cui, Shaojie Wang, Wentian Zhao, Chenliang Xu
NeurIPS 2021 Multi-Modal Dependency Tree for Video Captioning Wentian Zhao, Xinxiao Wu, Jiebo Luo
AAAI 2020 MemCap: Memorizing Style Knowledge for Image Captioning Wentian Zhao, Xinxiao Wu, Xiaoxun Zhang
IJCAI 2020 Video Question Answering on Screencast Tutorials Wentian Zhao, Seokhwan Kim, Ning Xu, Hailin Jin
IJCAI 2019 GAN-EM: GAN Based EM Learning Framework Wentian Zhao, Shaojie Wang, Zhihuai Xie, Jing Shi, Chenliang Xu
ICCV 2019 Joint Syntax Representation Learning and Visual Cue Translation for Video Captioning Jingyi Hou, Xinxiao Wu, Wentian Zhao, Jiebo Luo, Yunde Jia