TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Authors

  • Rui Song School of Artificial Intelligence, Jilin University
  • Fausto Giunchiglia Department of Information Engineering and Computer Science, University of Trento School of Artificial Intelligence, Jilin University College of Computer Science and Technology, Jilin University
  • Yingji Li College of Computer Science and Technology, Jilin University
  • Mingjie Tian School of Artificial Intelligence, Jilin University
  • Hao Xu School of Artificial Intelligence, Jilin University College of Computer Science and Technology, Jilin University Chongqing Research Institute, Jilin University

DOI:

https://doi.org/10.1609/aaai.v38i17.29866

Keywords:

NLP: Text Classification, NLP: Applications

Abstract

Cross-domain text classification aims to transfer models from label-rich source domains to label-poor target domains, giving it a wide range of practical applications. Many approaches promote cross-domain generalization by capturing domaininvariant features. However, these methods rely on unlabeled samples provided by the target domains, which renders the model ineffective when the target domain is agnostic. Furthermore, the models are easily disturbed by shortcut learning in the source domain, which also hinders the improvement of domain generalization ability. To solve the aforementioned issues, this paper proposes TACIT, a target domain agnostic feature disentanglement framework which adaptively decouples robust and unrobust features by Variational Auto-Encoders. Additionally, to encourage the separation of unrobust features from robust features, we design a feature distillation task that compels unrobust features to approximate the output of the teacher. The teacher model is trained with a few easy samples that are easy to carry potential unknown shortcuts. Experimental results verify that our framework achieves comparable results to state-of-the-art baselines while utilizing only source domain data.

Published

2024-03-24

How to Cite

Song, R., Giunchiglia, F., Li, Y., Tian, M., & Xu, H. (2024). TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18999-19007. https://doi.org/10.1609/aaai.v38i17.29866

Issue

Section

AAAI Technical Track on Natural Language Processing II