A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators

Cite

Text

Zhang et al. "A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I17.29923

Markdown

[Zhang et al. "A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-comprehensive/) doi:10.1609/AAAI.V38I17.29923

BibTeX

@inproceedings{zhang2024aaai-comprehensive,
  title     = {{A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators}},
  author    = {Zhang, Chen and D'Haro, Luis Fernando and Chen, Yiming and Zhang, Malu and Li, Haizhou},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {19515-19524},
  doi       = {10.1609/AAAI.V38I17.29923},
  url       = {https://mlanthology.org/aaai/2024/zhang2024aaai-comprehensive/}
}