Xie, Hongtao
53 publications
NeurIPS
2025
CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
ICCV
2025
Forensic-MoE: Exploring Comprehensive Synthetic Image Detection Traces with Mixture of Experts
CVPR
2025
Hybrid-Level Instruction Injection for Video Token Compression in Multi-Modal Large Language Models
ICCV
2025
Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design
IJCAI
2025
IterMeme: Expert-Guided Multimodal LLM for Interactive Meme Creation with Layout-Aware Generation
IJCAI
2024
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
IJCAI
2024
Self-Supervised Pre-Training with Symmetric Superimposition Modeling for Scene Text Recognition
IJCAI
2023
Linguistic More: Taking a Further Step Toward Efficient and Accurate Scene Text Recognition
NeurIPS
2022
Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets