C2L: Causally Contrastive Learning for Robust Text Classification

Seungtaek Choi; Myeongho Jeong; Hojae Han; Seung-won Hwang

doi:10.1609/aaai.v36i10.21296

Authors

Seungtaek Choi Yonsei University
Myeongho Jeong Yonsei University
Hojae Han Seoul National University
Seung-won Hwang Seoul National University

DOI:

https://doi.org/10.1609/aaai.v36i10.21296

Keywords:

Speech & Natural Language Processing (SNLP)

Abstract

Despite the super-human accuracy of recent deep models in NLP tasks, their robustness is reportedly limited due to their reliance on spurious patterns. We thus aim to leverage contrastive learning and counterfactual augmentation for robustness. For augmentation, existing work either requires humans to add counterfactuals to the dataset or machines to automatically matches near-counterfactuals already in the dataset. Unlike existing augmentation is affected by spurious correlations, ours, by synthesizing “a set” of counterfactuals, and making a collective decision on the distribution of predictions on this set, can robustly supervise the causality of each term. Our empirical results show that our approach, by collective decisions, is less sensitive to task model bias of attribution-based synthesis, and thus achieves significant improvements, in diverse dimensions: 1) counterfactual robustness, 2) cross-domain generalization, and 3) generalization from scarce data.

C2L: Causally Contrastive Learning for Robust Text Classification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information