A Causal Target for Learning to Defer Under Hidden Confounding

Authors

  • Yanmin Li Laboratory for Big Data and Decision, National University of Defense Technology
  • Lihua Liu Laboratory for Big Data and Decision, National University of Defense Technology
  • Xin Wang University of Science and Technology of China
  • Zhilong Mao Laboratory for Big Data and Decision, National University of Defense Technology
  • Jibing Wu Laboratory for Big Data and Decision, National University of Defense Technology
  • Weidong Bao Laboratory for Big Data and Decision, National University of Defense Technology

DOI:

https://doi.org/10.1609/aaai.v40i28.39493

Abstract

Learning decision policies from confounded observational data is a challenging task in causal inference, as unobserved confounders can lead to biased or suboptimal actions when relying solely on machine learning models. A synergistic approach is learning to defer, which decides when to act itself and when to defer to a human expert with access to unobserved information. However, constructing the learning target, which defines the probability of choosing each action or deferral, remains a core challenge. To address this, we propose causal-target-based learning to defer (CTLD) framework, where the causal target is constructed from sharp bounds on potential outcomes. Specifically, the degree of overlap between these bounds determines the probability of deferral, while their relative positions and widths define the probabilities over actions. CTLD aligns model predictions with this causal target to make probabilistic decisions over actions and deferral. We present comprehensive theoretical guarantees for the learned policy and demonstrate the effectiveness of CTLD on synthetic and semi-synthetic datasets.

Published

2026-03-14

How to Cite

Li, Y., Liu, L., Wang, X., Mao, Z., Wu, J., & Bao, W. (2026). A Causal Target for Learning to Defer Under Hidden Confounding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23248–23255. https://doi.org/10.1609/aaai.v40i28.39493

Issue

Section

AAAI Technical Track on Machine Learning V