Maximizing the Position Embedding for Vision Transformers with Global Average Pooling

Cite

Text

Lee et al. "Maximizing the Position Embedding for Vision Transformers with Global Average Pooling." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I17.33997

Markdown

[Lee et al. "Maximizing the Position Embedding for Vision Transformers with Global Average Pooling." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/lee2025aaai-maximizing/) doi:10.1609/AAAI.V39I17.33997

BibTeX

@inproceedings{lee2025aaai-maximizing,
  title     = {{Maximizing the Position Embedding for Vision Transformers with Global Average Pooling}},
  author    = {Lee, Wonjun and Ham, Bumsub and Kim, Suhyun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {18154-18162},
  doi       = {10.1609/AAAI.V39I17.33997},
  url       = {https://mlanthology.org/aaai/2025/lee2025aaai-maximizing/}
}