Window Token Concatenation for Efficient Visual Large Language Models

Cite

Text

Li et al. "Window Token Concatenation for Efficient Visual Large Language Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Li et al. "Window Token Concatenation for Efficient Visual Large Language Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/li2025cvprw-window/)

BibTeX

@inproceedings{li2025cvprw-window,
  title     = {{Window Token Concatenation for Efficient Visual Large Language Models}},
  author    = {Li, Yifan and Bao, Wentao and Ye, Botao and Tan, Zhen and Chen, Tianlong and Liu, Huan and Kong, Yu},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {3187-3197},
  url       = {https://mlanthology.org/cvprw/2025/li2025cvprw-window/}
}