EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

Authors

  • Jingheng Ye Tsinghua University
  • Shang Qin Tsinghua University
  • Yinghui Li Tsinghua University
  • Xuxin Cheng Peking University
  • Libo Qin Central South University
  • Hai-Tao Zheng Tsinghua University
  • Ying Shen Sun Yat-Sen University
  • Peng Xing Tsinghua University
  • Zishan Xu Tsinghua University
  • Guo Cheng Tsinghua University
  • Wenhao Jiang Guangming Laboratory of Artificial Intelligence and Digital Economy (SZ)

DOI:

https://doi.org/10.1609/aaai.v39i24.34759

Abstract

Existing studies explore the explainability of Grammatical Error Correction (GEC) in a limited scenario, where they ignore the interaction between corrections and explanations and have not established a corresponding comprehensive benchmark. To bridge the gap, this paper first introduces the task of EXplainable GEC (EXGEC), which focuses on the integral role of correction and explanation tasks. To facilitate the task, we propose EXCGEC, a tailored benchmark for Chinese EXGEC consisting of 8,216 explanation-augmented samples featuring the design of hybrid edit-wise explanations. We then benchmark several series of LLMs in multi-task learning settings, including post-explaining and pre-explaining. To promote the development of the task, we also build a comprehensive evaluation suite by leveraging existing automatic metrics and conducting human evaluation experiments to demonstrate the human consistency of the automatic metrics for free-text explanations. Our experiments reveal the effectiveness of evaluating free-text explanations using traditional metrics like METEOR and ROUGE, and the inferior performance of multi-task models compared to the pipeline solution, indicating its challenges to establish positive effects in learning both tasks.

Published

2025-04-11

How to Cite

Ye, J., Qin, S., Li, Y., Cheng, X., Qin, L., Zheng, H.-T., Shen, Y., Xing, P., Xu, Z., Cheng, G., & Jiang, W. (2025). EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction. Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), 25678-25686. https://doi.org/10.1609/aaai.v39i24.34759

Issue

Section

AAAI Technical Track on Natural Language Processing III