A Causal Target for Learning to Defer Under Hidden Confounding

Yanmin Li; Lihua Liu; Xin Wang; Zhilong Mao; Jibing Wu; Weidong Bao

doi:10.1609/aaai.v40i28.39493

Authors

Yanmin Li Laboratory for Big Data and Decision, National University of Defense Technology
Lihua Liu Laboratory for Big Data and Decision, National University of Defense Technology
Xin Wang University of Science and Technology of China
Zhilong Mao Laboratory for Big Data and Decision, National University of Defense Technology
Jibing Wu Laboratory for Big Data and Decision, National University of Defense Technology
Weidong Bao Laboratory for Big Data and Decision, National University of Defense Technology

DOI:

https://doi.org/10.1609/aaai.v40i28.39493

Abstract

Learning decision policies from confounded observational data is a challenging task in causal inference, as unobserved confounders can lead to biased or suboptimal actions when relying solely on machine learning models. A synergistic approach is learning to defer, which decides when to act itself and when to defer to a human expert with access to unobserved information. However, constructing the learning target, which defines the probability of choosing each action or deferral, remains a core challenge. To address this, we propose causal-target-based learning to defer (CTLD) framework, where the causal target is constructed from sharp bounds on potential outcomes. Specifically, the degree of overlap between these bounds determines the probability of deferral, while their relative positions and widths define the probabilities over actions. CTLD aligns model predictions with this causal target to make probabilistic decisions over actions and deferral. We present comprehensive theoretical guarantees for the learned policy and demonstrate the effectiveness of CTLD on synthetic and semi-synthetic datasets.

A Causal Target for Learning to Defer Under Hidden Confounding

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information