ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference

Cite

Text

Zeng et al. "ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I17.29922

Markdown

[Zeng et al. "ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zeng2024aaai-consistentee/) doi:10.1609/AAAI.V38I17.29922

BibTeX

@inproceedings{zeng2024aaai-consistentee,
  title     = {{ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference}},
  author    = {Zeng, Ziqian and Hong, Yihuai and Dai, Hongliang and Zhuang, Huiping and Chen, Cen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {19506-19514},
  doi       = {10.1609/AAAI.V38I17.29922},
  url       = {https://mlanthology.org/aaai/2024/zeng2024aaai-consistentee/}
}