SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Harrison Lee; Raghav Gupta; Abhinav Rastogi; Yuan Cao; Bin Zhang; Yonghui Wu

doi:10.1609/aaai.v36i10.21341

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Authors

Harrison Lee Google Research
Raghav Gupta Google Research
Abhinav Rastogi Google Research
Yuan Cao Google Research
Bin Zhang Google Research
Yonghui Wu Google Research

DOI:

https://doi.org/10.1609/aaai.v36i10.21341

Keywords:

Speech & Natural Language Processing (SNLP)

Abstract

Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support any service in zero-shot through schemas, which describe service APIs to models in natural language. We explore the robustness of dialogue systems to linguistic variations in schemas by designing SGD-X - a benchmark extending SGD with semantically similar yet stylistically diverse variants for every schema. We observe that two top state tracking models fail to generalize well across schema variants, measured by joint goal accuracy and a novel metric for measuring schema sensitivity. Additionally, we present a simple model-agnostic data augmentation method to improve schema robustness.

Downloads

Published

2022-06-28

How to Cite

Lee, H., Gupta, R., Rastogi, A., Cao, Y., Zhang, B., & Wu, Y. (2022). SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems. Proceedings of the AAAI Conference on Artificial Intelligence, 36(10), 10938–10946. https://doi.org/10.1609/aaai.v36i10.21341

Download Citation

Issue

Vol. 36 No. 10: AAAI-22 Technical Tracks 10

Section

AAAI Technical Track on Speech and Natural Language Processing

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information