On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models

Authors

  • Tong Ye Ping An Technology (Shenzhen) Co., Ltd. University of Science and Technology of China
  • Shijing Si Ping An Technology (Shenzhen) Co., Ltd.
  • Jianzong Wang Ping An Technology (Shenzhen) Co., Ltd.
  • Ning Cheng Ping An Technology (Shenzhen) Co., Ltd.
  • Zhitao Li Ping An Technology (Shenzhen) Co., Ltd.
  • Jing Xiao Ping An Technology (Shenzhen) Co., Ltd.

DOI:

https://doi.org/10.1609/aaai.v37i11.26630

Keywords:

SNLP: Conversational AI/Dialogue Systems, ML: Calibration & Uncertainty Quantification, RU: Applications, SNLP: Question Answering

Abstract

Deep neural retrieval models have amply demonstrated their power but estimating the reliability of their predictions remains challenging. Most dialog response retrieval models output a single score for a response on how relevant it is to a given question. However, the bad calibration of deep neural network results in various uncertainty for the single score such that the unreliable predictions always misinform user decisions. To investigate these issues, we present an efficient calibration and uncertainty estimation framework PG-DRR for dialog response retrieval models which adds a Gaussian Process layer to a deterministic deep neural network and recovers conjugacy for tractable posterior inference by Pólya-Gamma augmentation. Finally, PG-DRR achieves the lowest empirical calibration error (ECE) in the in-domain datasets and the distributional shift task while keeping R10@1 and MAP performance.

Downloads

Published

2023-06-26

How to Cite

Ye, T., Si, S., Wang, J., Cheng, N., Li, Z., & Xiao, J. (2023). On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13923-13931. https://doi.org/10.1609/aaai.v37i11.26630

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing