Wu, Qiong
17 publications
NeurIPS
2025
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
NeurIPS
2025
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking
AAAI
2025
Fit and Prune: Fast and Training-Free Visual Token Pruning for Multi-Modal Large Language Models
ICLR
2025
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-Modal Large Language Models