Zeng, Gangyan

4 publications

CVPR 2025 CLIP Is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval Without OCR Xugong Qin, Peng Zhang, Jun Jie Ou Yang, Gangyan Zeng, Yubo Li, Yuanyuan Wang, Wanqian Zhang, Pengwen Dai
CVPR 2025 Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark Hao Guo, Xugong Qin, Jun Jie Ou Yang, Peng Zhang, Gangyan Zeng, Yubo Li, Hailun Lin
AAAI 2025 Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues Yan Zhang, Gangyan Zeng, Huawen Shen, Daiqing Wu, Yu Zhou, Can Ma
NeurIPS 2025 When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam Lim, Harry Yang, Nicu Sebe