Huang, Mingxin

5 publications

ICLR 2025 Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid Mingxin Huang, Yuliang Liu, Dingkang Liang, Lianwen Jin, Xiang Bai
NeurIPS 2025 OCRBench V2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning Ling Fu, Zhebin Kuang, Jiajun Song, Mingxin Huang, Biao Yang, Yuzhe Li, Linghao Zhu, Qidi Luo, Xinyu Wang, Hao Lu, Zhang Li, Guozhi Tang, Bin Shan, Chunhui Lin, Qi Liu, Binghong Wu, Hao Feng, Hao Liu, Can Huang, Jingqun Tang, Wei Chen, Lianwen Jin, Yuliang Liu, Xiang Bai
CVPR 2024 Bridging the Gap Between End-to-End and Two-Step Text Spotting Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin
ICCV 2023 ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin
CVPR 2022 SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin