Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences

Cite

Text

Feng et al. "Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I11.26502

Markdown

[Feng et al. "Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/feng2023aaai-diffuser/) doi:10.1609/AAAI.V37I11.26502

BibTeX

@inproceedings{feng2023aaai-diffuser,
  title     = {{Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences}},
  author    = {Feng, Aosong and Li, Irene and Jiang, Yuang and Ying, Rex},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {12772-12780},
  doi       = {10.1609/AAAI.V37I11.26502},
  url       = {https://mlanthology.org/aaai/2023/feng2023aaai-diffuser/}
}