MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models

Cite

Text

Cai et al. "MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29723

Markdown

[Cai et al. "MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/cai2024aaai-medbench/) doi:10.1609/AAAI.V38I16.29723

BibTeX

@inproceedings{cai2024aaai-medbench,
  title     = {{MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models}},
  author    = {Cai, Yan and Wang, Linlin and Wang, Ye and de Melo, Gerard and Zhang, Ya and Wang, Yanfeng and He, Liang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {17709-17717},
  doi       = {10.1609/AAAI.V38I16.29723},
  url       = {https://mlanthology.org/aaai/2024/cai2024aaai-medbench/}
}