Empowering and Assessing the Utility of Large Language Models in Crop Science
Abstract
Large language models (LLMs) have demonstrated remarkable efficacy across knowledge-intensive tasks. Nevertheless, their untapped potential in crop science presents an opportunity for advancement. To narrow this gap, we introduce CROP, which includes a novel instruction tuning dataset specifically designed to enhance LLMs’ professional capabilities in the crop science sector, along with a benchmark that serves as a comprehensive evaluation of LLMs’ understanding of the domain knowledge. The CROP dataset is curated through a task-oriented and LLM-human integrated pipeline, comprising 210,038 single-turn and 1,871 multi-turn dialogues related to crop science scenarios. The CROP benchmark includes 5,045 multiple-choice questions covering three difficulty levels. Our experiments based on the CROP benchmark demonstrate notable enhancements in crop science-related tasks when LLMs are fine-tuned with the CROP dataset. To the best of our knowledge, CROP dataset is the first-ever instruction tuning dataset in the crop science domain. We anticipate that CROP will accelerate the adoption of LLMs in the domain of crop science, ultimately contributing to global food production.
Cite
Text
Zhang et al. "Empowering and Assessing the Utility of Large Language Models in Crop Science." Neural Information Processing Systems, 2024. doi:10.52202/079017-1669Markdown
[Zhang et al. "Empowering and Assessing the Utility of Large Language Models in Crop Science." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/zhang2024neurips-empowering/) doi:10.52202/079017-1669BibTeX
@inproceedings{zhang2024neurips-empowering,
title = {{Empowering and Assessing the Utility of Large Language Models in Crop Science}},
author = {Zhang, Hang and Sun, Jiawei and Chen, Renqi and Liu, Wei and Yuan, Zhonghang and Zheng, Xinzhe and Wang, Zhefan and Yang, Zhiyuan and Yan, Hang and Zhong, Hansen and Wang, Xiqing and Ouyang, Wanli and Yang, Fan and Dong, Nanqing},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-1669},
url = {https://mlanthology.org/neurips/2024/zhang2024neurips-empowering/}
}