SAIL: Sample-Centric In-Context Learning for Document Information Extraction

Authors

  • Jinyu Zhang Shanghai Jiao Tong University
  • Zhiyuan You The Chinese University of Hong Kong
  • Jize Wang Shanghai Jiao Tong University
  • Xinyi Le Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v39i24.34780

Abstract

Document Information Extraction (DIE) aims to extract structured information from Visually Rich Documents (VRDs). Previous full-training approaches have demonstrated strong performance but may struggle with generalization to unseen data. In contrast, training-free methods leverage powerful pre-trained models like Large Language Models (LLMs) to address various downstream tasks with only a few examples. Nonetheless, training-free methods for DIE encounter two primary challenges: (1) understanding the complex relationship between layout and textual elements in VRDs, and (2) providing accurate guidance to pre-trained models. To address these challenges, we propose SAmple-centric In-context Learning (SAIL). SAIL introduces a fine-grained entity-level textual similarity to facilitate in-depth text analysis by LLMs and incorporates layout similarity to enhance the analysis of layouts in VRDs. Moreover, SAIL formulates a unified In-Context Learning (ICL) prompt template for various sample-centric examples, enabling tailored prompts that deliver precise guidance to pre-trained models for each sample. Extensive experiments on FUNSD, CORD, and SROIE benchmarks with various base models (e.g., LLMs) indicate that our SAIL outperforms training-free baselines, even closer to the full-training methods, showing the superiority and generalization of our method.

Published

2025-04-11

How to Cite

Zhang, J., You, Z., Wang, J., & Le, X. (2025). SAIL: Sample-Centric In-Context Learning for Document Information Extraction. Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), 25868–25876. https://doi.org/10.1609/aaai.v39i24.34780

Issue

Section

AAAI Technical Track on Natural Language Processing III