EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

Yangning Li; Shirong Ma; Xiaobin Wang; Shen Huang; Chengyue Jiang; Hai-Tao Zheng; Pengjun Xie; Fei Huang; Yong Jiang

doi:10.1609/aaai.v38i17.29820

Authors

Yangning Li SIGS, Tsinghua University PengCheng Laboratory
Shirong Ma SIGS, Tsinghua University
Xiaobin Wang DAMO Academy, Alibaba Group
Shen Huang DAMO Academy, Alibaba Group
Chengyue Jiang Shanghaitech University
Hai-Tao Zheng SIGS, Tsinghua University PengCheng Laboratory
Pengjun Xie DAMO Academy, Alibaba Group
Fei Huang DAMO Academy, Alibaba Group
Yong Jiang DAMO Academy, Alibaba Group

DOI:

https://doi.org/10.1609/aaai.v38i17.29820

Keywords:

NLP: (Large) Language Models, NLP: Applications, NLP: Information Extraction, NLP: Other

Abstract

Recently, instruction-following Large Language Models (LLMs) , represented by ChatGPT, have exhibited exceptional performance in general Natural Language Processing (NLP) tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first E-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks. The EcomGPT will be public at https://github.com/Alibaba-NLP/EcomGPT.

EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information