Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Authors

  • Yuqi Zhu Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Yi Zhong Zhejiang University
  • Jintian Zhang Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Ziheng Zhang Independent Researcher
  • Shuofei Qiao Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Yujie Luo Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Lun Du Ant Group Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Da Zheng Ant Group Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Ningyu Zhang Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph
  • Huajun Chen Zhejiang University Zhejiang University - Ant Group Joint Laboratory of Knowledge Graph

DOI:

https://doi.org/10.1609/aaai.v40i41.40831

Abstract

Large Language Models (LLMs) hold promise in automating data analysis tasks, yet open-source models face significant limitations in these kinds of reasoning-intensive scenarios. In this work, we investigate strategies to enhance the data analysis capabilities of open-source LLMs. By curating a seed dataset of diverse, realistic scenarios, we evaluate models across three dimensions: data understanding, code generation, and strategic planning. Our analysis reveals three key findings: (1) Strategic planning quality serves as the primary determinant of model performance; (2) Interaction design and task complexity significantly influence reasoning capabilities; (3) Data quality demonstrates a greater impact than diversity in achieving optimal performance. We leverage these insights to develop a data synthesis methodology, demonstrating significant improvements in open-source LLMs' analytical reasoning capabilities.

Published

2026-03-14

How to Cite

Zhu, Y., Zhong, Y., Zhang, J., Zhang, Z., Qiao, S., Luo, Y., … Chen, H. (2026). Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study. Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), 35239–35247. https://doi.org/10.1609/aaai.v40i41.40831

Issue

Section

AAAI Technical Track on Natural Language Processing VI