Yu, Zhi

7 publications

ICLR 2026 IWR-Bench: Can LVLMs Reconstruct Interactive Webpage from a User Interaction Video? Yang Chen, Minghao Liu, Yufan Shen, Yunwen Li, Tianyuan Huang, Xinyu Fang, Tianyu Zheng, Wenxuan Huang, Cheng Yang, Licheng Wen, Xuemeng Yang, Daocheng Fu, Jianbiao Mei, Rong Wu, Song Mao, Qunshu Lin, Zhi Yu, Yongliang Shen, Yu Qiao, Botian Shi
ICLR 2026 Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders Yizhou Wang, Song Mao, Yang Chen, Yufan Shen, Pinlong Cai, Ding Wang, Guohang Yan, Zhi Yu, Yinqiao Yan, Xuming Hu, Botian Shi
AAAI 2025 ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data Yufan Shen, Chuwei Luo, Zhaoqing Zhu, Yang Chen, Qi Zheng, Zhi Yu, Jiajun Bu, Cong Yao
CVPR 2024 LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao
ECCV 2024 WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation Zirui Shao, Feiyu Gao, Hangdi Xing, Zepeng Zhu, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao
AAAI 2023 LORE: Logical Location Regression Network for Table Structure Recognition Hangdi Xing, Feiyu Gao, Rujiao Long, Jiajun Bu, Qi Zheng, Liangcheng Li, Cong Yao, Zhi Yu
ECCV 2020 An End-to-End OCR Text Re-Organization Sequence Learning for Rich-Text Detail Image Comprehension Liangcheng Li, Feiyu Gao, Jiajun Bu, Yongpan Wang, Zhi Yu, Qi Zheng