Nuance Matters: Probing Epistemic Consistency in Causal Reasoning

Authors

  • Shaobo Cui EPFL, Switzerland
  • Junyou Li University of Waterloo, Canada
  • Luca Mouchel EPFL, Switzerland
  • Yiyang Feng EPFL, Switzerland
  • Boi Faltings EPFL, Switzerland

DOI:

https://doi.org/10.1609/aaai.v39i22.34542

Abstract

Previous research on causal reasoning often overlooks the subtleties crucial to understanding causal reasoning. To address this gap, our study introduces the concept of causal epistemic consistency, which focuses on the self-consistency of Large Language Models (LLMs) in differentiating intermediates with nuanced differences in causal reasoning. We propose a suite of novel metrics -- intensity ranking concordance, cross-group position agreement, and intra-group clustering -- to evaluate LLMs on this front. Through extensive empirical studies on 21 high-profile LLMs, including GPT-4, Claude3, and LLaMA3-70B, we have favoring evidence that current models struggle to maintain epistemic consistency in identifying the polarity and intensity of intermediates in causal reasoning. Additionally, we explore the potential of using internal token probabilities as an auxiliary tool to maintain causal epistemic consistency. In summary, our study bridges a critical gap in AI research by investigating the self-consistency over fine-grained intermediates involved in causal reasoning.

Downloads

Published

2025-04-11

How to Cite

Cui, S., Li, J., Mouchel, L., Feng, Y., & Faltings, B. (2025). Nuance Matters: Probing Epistemic Consistency in Causal Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23715–23723. https://doi.org/10.1609/aaai.v39i22.34542

Issue

Section

AAAI Technical Track on Natural Language Processing I