Liu, Cheng-Lin
43 publications
CVPR
2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
AAAI
2025
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
43 publications