Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

Authors

  • Da Luo University of Electronic Science and Technology of China
  • Yanglei Gan University of Electronic Science and Technology of China
  • Rui Hou University of Electronic Science and Technology of China
  • Run Lin University of Electronic Science and Technology of China
  • Qiao Liu University of Electronic Science and Technology of China
  • Yuxiang Cai University of Electronic Science and Technology of China
  • Wannian Gao University of Electronic Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v38i17.29838

Keywords:

NLP: Information Extraction, NLP: Text Classification

Abstract

Few-shot Relation Extraction (FSRE) aims to extract relational facts from a sparse set of labeled corpora. Recent studies have shown promising results in FSRE by employing Pre-trained Language Models (PLMs) within the framework of supervised contrastive learning, which considers both instances and label facts. However, how to effectively harness massive instance-label pairs to encompass the learned representation with semantic richness in this learning paradigm is not fully explored. To address this gap, we introduce a novel synergistic anchored contrastive pre-training framework. This framework is motivated by the insight that the diverse viewpoints conveyed through instance-label pairs capture incomplete yet complementary intrinsic textual semantics. Specifically, our framework involves a symmetrical contrastive objective that encompasses both sentence-anchored and label-anchored contrastive losses. By combining these two losses, the model establishes a robust and uniform representation space. This space effectively captures the reciprocal alignment of feature distributions among instances and relational facts, simultaneously enhancing the maximization of mutual information across diverse perspectives within the same relation. Experimental results demonstrate that our framework achieves significant performance enhancements compared to baseline models in downstream FSRE tasks. Furthermore, our approach exhibits superior adaptability to handle the challenges of domain shift and zero-shot relation extraction. Our code is available online at https://github.com/AONE-NLP/FSRE-SaCon.

Published

2024-03-24

How to Cite

Luo, D., Gan, Y., Hou, R., Lin, R., Liu, Q., Cai, Y., & Gao, W. (2024). Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18742-18750. https://doi.org/10.1609/aaai.v38i17.29838

Issue

Section

AAAI Technical Track on Natural Language Processing II