Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning

Authors

  • Karl Elbakian University of New Hampshire, Durham, NH
  • Samuel Carton University of New Hampshire, Durham, NH

DOI:

https://doi.org/10.1609/aaai.v39i26.34936

Abstract

A key aspect of alignment is the proper use of within-document evidence to construct document-level decisions. We analyze the relationship between the retrieval and interpretation of within-document evidence for large language model in a few-shot setting. Specifically, we measure the extent to which model prediction errors are associated with evidence retrieval errors with respect to gold-standard human-annotated extractive evidence for five datasets, using two popular closed proprietary models. We perform two ablation studies to investigate when both label prediction and evidence retrieval errors can be attributed to qualities of the relevant evidence. We find that there is a strong empirical relationship between model prediction and evidence retrieval error, but that evidence retrieval error is mostly not associated with evidence interpretation error--a hopeful sign for downstream applications built on this mechanism.

Downloads

Published

2025-04-11

How to Cite

Elbakian, K., & Carton, S. (2025). Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(26), 27268–27276. https://doi.org/10.1609/aaai.v39i26.34936

Issue

Section

AAAI Technical Track on AI Alignment