A Domain-Transfer Meta Task Design Paradigm for Few-Shot Slot Tagging
Keywords:SNLP: Conversational AI/Dialogue Systems, ML: Meta Learning, SNLP: Text Mining
AbstractFew-shot slot tagging is an important task in dialogue systems and attracts much attention of researchers. Most previous few-shot slot tagging methods utilize meta-learning procedure for training and strive to construct a large number of different meta tasks to simulate the testing situation of insufficient data. However, there is a widespread phenomenon of overlap slot between two domains in slot tagging. Traditional meta tasks ignore this special phenomenon and cannot simulate such realistic few-shot slot tagging scenarios. It violates the basic principle of meta-learning which the meta task is consistent with the real testing task, leading to historical information forgetting problem. In this paper, we introduce a novel domain-transfer meta task design paradigm to tackle this problem. We distribute a basic domain to each target domain based on the coincidence degree of slot labels between these two domains. Unlike classic meta tasks which only rely on small samples of target domain, our meta tasks aim to correctly infer the class of target domain query samples based on both abundant data in basic domain and scarce data in target domain. To accomplish our meta task, we propose a Task Adaptation Network to effectively transfer the historical information from the basic domain to the target domain. We carry out sufficient experiments on the benchmark slot tagging dataset SNIPS and the name entity recognition dataset NER. Results demonstrate that our proposed model outperforms previous methods and achieves the state-of-the-art performance.
How to Cite
Yang, F., Zhou, X., Yang, Y., Ma, B., Dong, R., & Atawulla, A. (2023). A Domain-Transfer Meta Task Design Paradigm for Few-Shot Slot Tagging. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13887-13895. https://doi.org/10.1609/aaai.v37i11.26626
AAAI Technical Track on Speech & Natural Language Processing