Multi-Mask Label Mapping for Prompt-Based Learning

Jirui Qi; Richong Zhang; Jaein Kim; Junfan Chen; Wenyi Qin; Yongyi Mao

doi:10.1609/aaai.v37i11.26579

Authors

Jirui Qi Beihang University
Richong Zhang Beihang University
Jaein Kim Beihang University
Junfan Chen Beihang University
Wenyi Qin Beihang University
Yongyi Mao University of Ottawa

DOI:

https://doi.org/10.1609/aaai.v37i11.26579

Keywords:

SNLP: Text Classification, SNLP: Applications, SNLP: Language Models

Abstract

Prompt-based Learning has shown significant success in few-shot classification. The mainstream approach is to concatenate a template for the input text to transform the classification task into a cloze-type task where label mapping plays an important role in finding the ground-truth labels. While current label mapping methods only use the contexts in one single input, it could be crucial if wrong information is contained in the text. Specifically, it is proved in recent work that even the large language models like BERT/RoBERTa make classification decisions heavily dependent on a specific keyword regardless of the task or the context. Such a word is referred to as a lexical cue and if a misleading lexical cue is included in the instance it will lead the model to make a wrong prediction. We propose a multi-mask prompt-based approach with Multi-Mask Label Mapping (MMLM) to reduce the impact of misleading lexical cues by allowing the model to exploit multiple lexical cues. To satisfy the conditions of few-shot learning, an instance augmentation approach for the cloze-type model is proposed and the misleading cues are gradually excluded through training. We demonstrate the effectiveness of MMLM by both theoretical analysis and empirical studies, and show that MMLM outperforms other existing label mapping approaches.

Multi-Mask Label Mapping for Prompt-Based Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription