A Domain-Transfer Meta Task Design Paradigm for Few-Shot Slot Tagging

Fengyi Yang; Xi Zhou; Yating Yang; Bo Ma; Rui Dong; Abibulla Atawulla

doi:10.1609/aaai.v37i11.26626

Authors

Fengyi Yang Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
Xi Zhou Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
Yating Yang Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
Bo Ma Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
Rui Dong Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
Abibulla Atawulla Xinjiang Technical Institute of Physics & Chemistry, Chinese Academy of Sciences, Urumqi 830011, China University of Chinese Academy of Sciences, Beijing 100049, China Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China

DOI:

https://doi.org/10.1609/aaai.v37i11.26626

Keywords:

SNLP: Conversational AI/Dialogue Systems, ML: Meta Learning, SNLP: Text Mining

Abstract

Few-shot slot tagging is an important task in dialogue systems and attracts much attention of researchers. Most previous few-shot slot tagging methods utilize meta-learning procedure for training and strive to construct a large number of different meta tasks to simulate the testing situation of insufficient data. However, there is a widespread phenomenon of overlap slot between two domains in slot tagging. Traditional meta tasks ignore this special phenomenon and cannot simulate such realistic few-shot slot tagging scenarios. It violates the basic principle of meta-learning which the meta task is consistent with the real testing task, leading to historical information forgetting problem. In this paper, we introduce a novel domain-transfer meta task design paradigm to tackle this problem. We distribute a basic domain to each target domain based on the coincidence degree of slot labels between these two domains. Unlike classic meta tasks which only rely on small samples of target domain, our meta tasks aim to correctly infer the class of target domain query samples based on both abundant data in basic domain and scarce data in target domain. To accomplish our meta task, we propose a Task Adaptation Network to effectively transfer the historical information from the basic domain to the target domain. We carry out sufficient experiments on the benchmark slot tagging dataset SNIPS and the name entity recognition dataset NER. Results demonstrate that our proposed model outperforms previous methods and achieves the state-of-the-art performance.

A Domain-Transfer Meta Task Design Paradigm for Few-Shot Slot Tagging

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription