Input Compression with Positional Consistency for Efficient Training and Inference of Transformer Neural Networks

Cite

Text

Nagarajan and Raghunathan. "Input Compression with Positional Consistency for Efficient Training and Inference of Transformer Neural Networks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70362-1_5

Markdown

[Nagarajan and Raghunathan. "Input Compression with Positional Consistency for Efficient Training and Inference of Transformer Neural Networks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/nagarajan2024ecmlpkdd-input/) doi:10.1007/978-3-031-70362-1_5

BibTeX

@inproceedings{nagarajan2024ecmlpkdd-input,
  title     = {{Input Compression with Positional Consistency for Efficient Training and Inference of Transformer Neural Networks}},
  author    = {Nagarajan, Amrit and Raghunathan, Anand},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {73-88},
  doi       = {10.1007/978-3-031-70362-1_5},
  url       = {https://mlanthology.org/ecmlpkdd/2024/nagarajan2024ecmlpkdd-input/}
}