Self-Correction Distillation for Structured Data Question Answering

Yushan Zhu; Wen Zhang; Long Jin; Mengshu Sun; Ling Zhong; Zhiqiang Liu; Juan Li; Lei Liang; Chong Long; Chao Deng; Junlan Feng

doi:10.1609/aaai.v40i19.38697

Authors

Yushan Zhu Zhejiang University JIUTIAN Research
Wen Zhang Zhejiang University
Long Jin Zhejiang University
Mengshu Sun Ant Group
Ling Zhong Ant Group
Zhiqiang Liu Zhejiang University
Juan Li Zhejiang University
Lei Liang Ant Group
Chong Long JIUTIAN Research
Chao Deng JIUTIAN Research
Junlan Feng JIUTIAN Research

DOI:

https://doi.org/10.1609/aaai.v40i19.38697

Abstract

Structured data question answering (QA), including table QA, Knowledge Graph (KG) QA, and temporal KG QA, is a pivotal research area. Advances in large language models (LLMs) have driven significant progress in unified structural QA frameworks like TrustUQA. However, these frameworks face challenges when applied to small-scale LLMs since small-scale LLMs are prone to errors in generating structured queries. To improve the structured data QA ability of small-scale LLMs, we propose a self-correction distillation (SCD) method. In SCD, an error prompt mechanism (EPM) is designed to detect errors and provide customized error messages during inference, and a two-stage distillation strategy is designed to transfer large-scale LLMs' query-generation and error-correction capabilities to small-scale LLM. Experiments across 5 benchmarks with 3 structured data types demonstrate that our SCD achieves the best performance and superior generalization on small-scale LLM (8B) compared to other distillation methods, and closely approaches the performance of GPT4 on some datasets. Furthermore, large-scale LLMs equipped with EPM surpass the state-of-the-art results on most datasets.

Self-Correction Distillation for Structured Data Question Answering

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information