Conformal Autoregressive Generation: Beam Search with Coverage Guarantees

Authors

  • Nicolas Deutschmann IBM Research
  • Marvin Alberts IBM Research Zurich
  • María Rodríguez Martínez IBM Research

DOI:

https://doi.org/10.1609/aaai.v38i10.29062

Keywords:

ML: Calibration & Uncertainty Quantification, APP: Natural Sciences, NLP: Generation

Abstract

We introduce two new extensions to the beam search algorithm based on conformal predictions (CP) to produce sets of sequences with theoretical coverage guarantees. The first method is very simple and proposes dynamically-sized subsets of beam search results but, unlike typical CP proceedures, has an upper bound on the achievable guarantee depending on a post-hoc calibration measure. Our second algorithm introduces the conformal set prediction procedure as part of the decoding process, producing a variable beam width which adapts to the current uncertainty. While more complex, this procedure can achieve coverage guarantees selected a priori. We provide marginal coverage bounds as well as calibration-conditional guarantees for each method, and evaluate them empirically on a selection of tasks drawing from natural language processing and chemistry.

Downloads

Published

2024-03-24

How to Cite

Deutschmann, N., Alberts, M., & Rodríguez Martínez, M. (2024). Conformal Autoregressive Generation: Beam Search with Coverage Guarantees. Proceedings of the AAAI Conference on Artificial Intelligence, 38(10), 11775-11783. https://doi.org/10.1609/aaai.v38i10.29062

Issue

Section

AAAI Technical Track on Machine Learning I