Zhang, Shikun
32 publications
NeurIPS
2025
Boosting Resilience of Large Language Models Through Causality-Driven Robust Optimization
NeurIPS
2025
VLM-R³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
NeurIPS
2024
MaVEn: An Effective Multi-Granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model
ICCV
2023
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-up Patch Summarization.