Liu, Yuliang

33 publications

ICML 2025 AdaptiveStep: Automatically Dividing Reasoning Step Through Model Confidence Yuliang Liu, Junjie Lu, Chaofeng Qu, Zhaoling Chen, Zefan Cai, Jason Klein Liu, Chonghan Liu, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin
ICCV 2025 DocThinker: Explainable Multimodal Large Language Models with Rule-Based Reinforcement Learning for Document Understanding Wenwen Yu, Zhibo Yang, Yuliang Liu, Xiang Bai
ICCV 2025 LIRA: Inferring Segmentation in Large Multi-Modal Models with Local Interleaved Region Assistance Zhang Li, Biao Yang, Qiang Liu, Shuo Zhang, Zhiyin Ma, Liang Yin, Linger Deng, Yabo Sun, Yuliang Liu, Xiang Bai
ICLRW 2025 ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code Xiangru Tang, Yuliang Liu, Zefan Cai, Daniel Shao, Junjie Lu, Yichi Zhang, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Chen Sheng, Haozhe Zhao, Liang Chen, Tianyu Liu, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Zhiwei Jiang, Baobao Chang, Arman Cohan, Mark Gerstein
NeurIPS 2025 MSTAR: Box-Free Multi-Query Scene Text Retrieval with Attention Recycling Liang Yin, Xudong Xie, Zhang Li, Xiang Bai, Yuliang Liu
ICLR 2025 Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid Mingxin Huang, Yuliang Liu, Dingkang Liang, Lianwen Jin, Xiang Bai
ICCV 2025 Multi-Scenario Overlapping Text Segmentation with Depth Awareness Yang Liu, Xudong Xie, Yuliang Liu, Xiang Bai
NeurIPS 2025 OCRBench V2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning Ling Fu, Zhebin Kuang, Jiajun Song, Mingxin Huang, Biao Yang, Yuzhe Li, Linghao Zhu, Qidi Luo, Xinyu Wang, Hao Lu, Zhang Li, Guozhi Tang, Bin Shan, Chunhui Lin, Qi Liu, Binghong Wu, Hao Feng, Hao Liu, Can Huang, Jingqun Tang, Wei Chen, Lianwen Jin, Yuliang Liu, Xiang Bai
CVPR 2025 SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-End Text Spotting Dongliang Luo, Hanshen Zhu, Ziyang Zhang, Dingkang Liang, Xudong Xie, Yuliang Liu, Xiang Bai
ICCV 2025 Towards Comprehensive Lecture Slides Understanding: Large-Scale Dataset and Effective Method Enming Zhang, Yuzhe Li, Yuliang Liu, Yingying Zhu, Xiang Bai
ICCV 2025 Training-Free Geometric Image Editing on Diffusion Models Hanshen Zhu, Zhen Zhu, Kaile Zhang, Yiming Gong, Yuliang Liu, Xiang Bai
NeurIPS 2024 AP-Adapter: Improving Generalization of Automatic Prompts on Unseen Text-to-Image Diffusion Models Yuchen Fu, Zhiwei Jiang, Yuliang Liu, Cong Wang, Zexuan Deng, Zhaoling Chen, Qing Gu
TMLR 2024 Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-Creation Zhiwei Zhang, Yuliang Liu
CVPR 2024 Bridging the Gap Between End-to-End and Two-Step Text Spotting Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin
ICMLW 2024 CD-POS: Long Context Generalization in LLMs Through Continuous and Discrete Position Synthesis Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, WangYan, Wei Shen, Chao Yin, Bryan Hooi
NeurIPS 2024 MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks Xingkui Zhu, Yiran Guan, Dingkang Liang, Yuchao Chen, Yuliang Liu, Xiang Bai
CVPR 2024 Monkey: Image Resolution and Text Label Are Important Things for Large Multi-Modal Models Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai
CVPR 2024 OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang
AAAI 2024 ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining Dezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin
ICML 2024 Video-LaVIT: Unified Video-Language Pre-Training with Decoupled Visual-Motional Tokenization Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu
ECCVW 2024 Well Begun Is Half Done: The Importance of Initialization in Dataset Distillation Yiran Guan, Zhu Chen, Xingkui Zhu, Dingkang Liang, Yuliang Liu, Xiang Bai
ICCV 2023 ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin
CVPR 2023 Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution Chenfan Qu, Chongyu Liu, Yuliang Liu, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin
CVPR 2023 Turning a CLIP Model into a Scene Text Detector Wenwen Yu, Yuliang Liu, Wei Hua, Deqiang Jiang, Bo Ren, Xiang Bai
ECCV 2022 Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding
NeurIPS 2022 MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification Peirong Zhang, Jiajia Jiang, Yuliang Liu, Lianwen Jin
NeurIPS 2022 SAPA: Similarity-Aware Point Affiliation for Feature Upsampling Hao Lu, Wenze Liu, Zixuan Ye, Hongtao Fu, Yuliang Liu, Zhiguo Cao
CVPR 2022 SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin
AAAI 2019 DeRPN: Taking a Further Step Toward More General Object Detection Lele Xie, Yuliang Liu, Lianwen Jin, Zecheng Xie
AAAI 2019 EnsNet: Ensconce Text in the Wild Shuaitao Zhang, Yuliang Liu, Lianwen Jin, Yaoxiong Huang, Songxuan Lai
IJCAI 2019 Omnidirectional Scene Text Detection with Sequential-Free Box Discretization Yuliang Liu, Sheng Zhang, Lianwen Jin, Lele Xie, Yaqiang Wu, Zhepeng Wang
AAAI 2018 Feature Enhancement Network: A Refined Scene Text Detector Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo
CVPR 2017 Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection Yuliang Liu, Lianwen Jin