A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension

Authors

  • Zenan Xu School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
  • Linjun Shou Microsoft Search Technology Center Asia (STCA), Beijing, China
  • Jian Pei School of Computing Science, Simon Fraser University
  • Ming Gong Microsoft Search Technology Center Asia (STCA), Beijing, China
  • Qinliang Su School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China Guangdong Key Laboratory of Big Data Analysis and Processing, Guangzhou, China
  • Xiaojun Quan School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
  • Daxin Jiang Microsoft Search Technology Center Asia (STCA), Beijing, China

DOI:

https://doi.org/10.1609/aaai.v37i11.26623

Keywords:

SNLP: Machine Translation & Multilinguality

Abstract

Although great progress has been made for Machine Reading Comprehension (MRC) in English, scaling out to a large number of languages remains a huge challenge due to the lack of large amounts of annotated training data in non-English languages. To address this challenge, some recent efforts of cross-lingual MRC employ machine translation to transfer knowledge from English to other languages, through either explicit alignment or implicit attention. For effective knowledge transition, it is beneficial to leverage both semantic and syntactic information. However, the existing methods fail to explicitly incorporate syntax information in model learning. Consequently, the models are not robust to errors in alignment and noises in attention. In this work, we propose a novel approach, which jointly models the cross-lingual alignment information and the mono-lingual syntax information using a graph. We develop a series of algorithms, including graph construction, learning, and pre-training. The experiments on two benchmark datasets for cross-lingual MRC show that our approach outperforms all strong baselines, which verifies the effectiveness of syntax information for cross-lingual MRC.

Downloads

Published

2023-06-26

How to Cite

Xu, Z., Shou, L., Pei, J., Gong, M., Su, Q., Quan, X., & Jiang, D. (2023). A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13861-13868. https://doi.org/10.1609/aaai.v37i11.26623

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing