EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-Commerce
Abstract
Recently, instruction-following Large Language Models (LLMs) , represented by ChatGPT, have exhibited exceptional performance in general Natural Language Processing (NLP) tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first E-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks. The EcomGPT will be public at https://github.com/Alibaba-NLP/EcomGPT.
Cite
Text
Li et al. "EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-Commerce." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I17.29820Markdown
[Li et al. "EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-Commerce." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/li2024aaai-ecomgpt/) doi:10.1609/AAAI.V38I17.29820BibTeX
@inproceedings{li2024aaai-ecomgpt,
title = {{EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-Commerce}},
author = {Li, Yangning and Ma, Shirong and Wang, Xiaobin and Huang, Shen and Jiang, Chengyue and Zheng, Haitao and Xie, Pengjun and Huang, Fei and Jiang, Yong},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {18582-18590},
doi = {10.1609/AAAI.V38I17.29820},
url = {https://mlanthology.org/aaai/2024/li2024aaai-ecomgpt/}
}