Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference

Cite

Text

Lin et al. "Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I5.32567

Markdown

[Lin et al. "Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/lin2025aaai-boosting/) doi:10.1609/AAAI.V39I5.32567

BibTeX

@inproceedings{lin2025aaai-boosting,
  title     = {{Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference}},
  author    = {Lin, Zhihang and Lin, Mingbao and Lin, Luxi and Ji, Rongrong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {5334-5342},
  doi       = {10.1609/AAAI.V39I5.32567},
  url       = {https://mlanthology.org/aaai/2025/lin2025aaai-boosting/}
}