DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts

Authors

  • Yujing Lu Zhejiang Lab
  • Ling Zhong Zhejiang Lab
  • Jing Yang Zhejiang Lab
  • Weiming Li Zhejiang Lab
  • Peng Wei National Astronomical Observatories, Chinese Academy of Sciences
  • Yongheng Wang Zhejiang lab
  • Manni Duan Zhejiang Lab
  • Qing Zhang Zhejiang Lab

DOI:

https://doi.org/10.1609/aaai.v40i38.40509

Abstract

Chart Question Answering (CQA) evaluates Multimodal Large Language Models (MLLMs) on visual understanding and reasoning over chart data. However, existing benchmarks mostly test surface-level parsing, such as reading labels and legends, while overlooking deeper scientific reasoning. We propose DomainCQA, a framework for constructing domain-specific CQA benchmarks that emphasize both visual comprehension and knowledge-intensive reasoning. It integrates complexity-aware chart selection, multitier QA generation, and expert validation. Applied to astronomy, DomainCQA yields AstroChart, a benchmark of 1,690 QA pairs over 482 charts, exposing persistent weaknesses in fine-grained perception, numerical reasoning, and domain knowledge integration across 21 MLLMs. Fine-tuning on AstroChart improves performance across fundamental and advanced tasks. Pilot QA sets in biochemistry, economics, medicine, and social science further demonstrate DomainCQA’s generality. Together, our results establish DomainCQA as a unified pipeline for constructing and augmenting domain-specific chart reasoning benchmarks.

Published

2026-03-14

How to Cite

Lu, Y., Zhong, L., Yang, J., Li, W., Wei, P., Wang, Y., … Zhang, Q. (2026). DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32347–32355. https://doi.org/10.1609/aaai.v40i38.40509

Issue

Section

AAAI Technical Track on Natural Language Processing III