Yao, Cong

35 publications

AAAI 2025 ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data Yufan Shen, Chuwei Luo, Zhaoqing Zhu, Yang Chen, Qi Zheng, Zhi Yu, Jiajun Bu, Cong Yao
AAAI 2024 FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin
CVPR 2024 LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao
CVPR 2024 OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang
ECCV 2024 Platypus: A Generalized Specialist Model for Reading Text in Various Forms Peng Wang, Zhaohai Li, Jun Tang, Humen Zhong, Fei Huang, Zhibo Yang, Cong Yao
ECCV 2024 Visual Text Generation in the Wild Yuanzhi Zhu, Jiawei Liu, Feiyu Gao, Wenyu Liu, Xinggang Wang, Peng Wang, Fei Huang, Cong Yao, Zhibo Yang
ECCV 2024 WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation Zirui Shao, Feiyu Gao, Hangdi Xing, Zepeng Zhu, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao
CVPR 2023 Conditional Text Image Generation with Diffusion Models Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao
CVPR 2023 GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction Chuwei Luo, Changxu Cheng, Qi Zheng, Cong Yao
ICCV 2023 LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition Changxu Cheng, Peng Wang, Cheng Da, Qi Zheng, Cong Yao
AAAI 2023 LORE: Logical Location Regression Network for Table Structure Recognition Hangdi Xing, Feiyu Gao, Rujiao Long, Jiajun Bu, Qi Zheng, Liangcheng Li, Cong Yao, Zhi Yu
CVPR 2023 Modeling Entities as Semantic Points for Visual Information Extraction in the Wild Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao
ICCV 2023 Vision Grid Transformer for Document Layout Analysis Cheng Da, Chuwei Luo, Qi Zheng, Cong Yao
WACV 2022 Facial Attribute Transformers for Precise and Robust Makeup Transfer Zhaoyi Wan, Haoran Chen, Jie An, Wentao Jiang, Cong Yao, Jiebo Luo
ECCV 2022 Levenshtein OCR Cheng Da, Peng Wang, Cong Yao
ECCV 2022 Multi-Granularity Prediction for Scene Text Recognition Peng Wang, Cheng Da, Cong Yao
CVPR 2022 Revisiting Document Image Dewarping by Grid Regularization Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia
CVPR 2022 Vision-Language Pre-Training for Boosting Scene Text Detectors Sibo Song, Jianqiang Wan, Zhibo Yang, Jun Tang, Wenqing Cheng, Xiang Bai, Cong Yao
CVPR 2021 MOST: A Multi-Oriented Scene Text Detector with Localization Refinement Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, Xiang Bai
ECCV 2020 Differentiable Feature Aggregation Search for Knowledge Distillation Yushuo Guan, Pengyu Zhao, Bingxuan Wang, Yuanxing Zhang, Cong Yao, Kaigui Bian, Jian Tang
AAAI 2020 Real-Time Scene Text Detection with Differentiable Binarization Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, Xiang Bai
AAAI 2020 TextScanner: Reading Characters in Order for Robust Scene Text Recognition Zhaoyi Wan, Minghang He, Haoran Chen, Xiang Bai, Cong Yao
AAAI 2019 Scene Text Detection with Supervised Pyramid Context Network Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li
AAAI 2019 Scene Text Recognition from Two-Dimensional Perspective Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai
ECCV 2018 Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai
ECCV 2018 TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu, Cong Yao
CVPR 2017 EAST: An Efficient and Accurate Scene Text Detector Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang
CVPR 2016 Multi-Oriented Text Detection with Fully Convolutional Networks Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai
CVPR 2016 Robust Scene Text Recognition with Automatic Rectification Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai
ICCV 2015 Relaxed Multiple-Instance SVM with Application to Object Discovery Xinggang Wang, Zhuotun Zhu, Cong Yao, Xiang Bai
CVPR 2015 Symmetry-Based Text Line Detection in Natural Scenes Zheng Zhang, Wei Shen, Cong Yao, Xiang Bai
ECCV 2014 Human Detection Using Learned Part Alphabet and Pose Dictionary Cong Yao, Xiang Bai, Wenyu Liu, Longin Jan Latecki
CVPR 2014 Strokelets: A Learned Multi-Scale Representation for Scene Text Recognition Cong Yao, Xiang Bai, Baoguang Shi, Wenyu Liu
CVPR 2012 Detecting Texts of Arbitrary Orientations in Natural Images Cong Yao, Xiang Bai, Wenyu Liu, Yi Ma, Zhuowen Tu
CVPRW 2012 Randomness and Sparsity Induced Codebook Learning with Application to Cancer Image Classification Quannan Li, Cong Yao, Liwei Wang, Zhuowen Tu