Mao, Song

5 publications

ICLR 2026 IWR-Bench: Can LVLMs Reconstruct Interactive Webpage from a User Interaction Video? Yang Chen, Minghao Liu, Yufan Shen, Yunwen Li, Tianyuan Huang, Xinyu Fang, Tianyu Zheng, Wenxuan Huang, Cheng Yang, Licheng Wen, Xuemeng Yang, Daocheng Fu, Jianbiao Mei, Rong Wu, Song Mao, Qunshu Lin, Zhi Yu, Yongliang Shen, Yu Qiao, Botian Shi
ICLR 2026 Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders Yizhou Wang, Song Mao, Yang Chen, Yufan Shen, Pinlong Cai, Ding Wang, Guohang Yan, Zhi Yu, Yinqiao Yan, Xuming Hu, Botian Shi
ICCV 2025 Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning Junming Liu, Siyuan Meng, Yanting Gao, Song Mao, Pinlong Cai, Guohang Yan, Yirong Chen, Zilin Bian, Ding Wang, Botian Shi
ICCV 2025 Chimera: Improving Generalist Model with Domain-Specific Experts Tianshuo Peng, Mingsheng Li, Jiakang Yuan, Hongbin Zhou, Renqiu Xia, Renrui Zhang, Lei Bai, Song Mao, Bin Wang, Aojun Zhou, Botian Shi, Tao Chen, Bo Zhang, Xiangyu Yue
CVPR 2007 Combining Static Classifiers and Class Syntax Models for Logical Entity Recognition in Scanned Historical Documents Song Mao, Praveer Mansukhani, George R. Thoma