FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation

Authors

  • Song Jin Renmin University of China
  • Shuqi Li Renmin University of China King Abdullah University of Science and Technology
  • Shukun Zhang Renmin University of China
  • Rui Yan Renmin University of China Wuhan University Wuhan Textile University

DOI:

https://doi.org/10.1609/aaai.v40i1.37014

Abstract

While LLMs have shown great success in financial tasks like stock prediction and question answering, their application in fully automating Equity Research Report generation remains uncharted territory. In this paper, we formulate the Equity Research Report (ERR) Generation task for the first time. To address the data scarcity and the evaluation metrics absence, we present an open-source evaluation benchmark for ERR generation - FinRpt. We frame a Dataset Construction Pipeline that integrates 7 financial data types and produces a high-quality ERR dataset automatically, which could be used for model training and evaluation. We also introduce a comprehensive evaluation system including 11 metrics to assess the generated ERRs. Moreover, we propose a multi-agent framework specifically tailored to address this task, named FinRpt-Gen, and train several LLM-based agents on the proposed datasets using Supervised Fine-Tuning and Reinforcement Learning. Experimental results indicate the data quality and metrics effectiveness of the benchmark FinRpt and the strong performance of FinRpt-Gen, showcasing their potential to drive innovation in the ERR generation field. All code and datasets are publicly available.

Published

2026-03-14

How to Cite

Jin, S., Li, S., Zhang, S., & Yan, R. (2026). FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(1), 507–515. https://doi.org/10.1609/aaai.v40i1.37014

Issue

Section

AAAI Technical Track on Application Domains I