On the Importance of Word Order Information in Cross-lingual Sequence Labeling

Zihan Liu; Genta I Winata; Samuel Cahyawijaya; Andrea Madotto; Zhaojiang Lin; Pascale Fung

doi:10.1609/aaai.v35i15.17588

Authors

Zihan Liu The Hong Kong University of Science and Technology
Genta I Winata Hong Kong University of Science and Technology
Samuel Cahyawijaya Hong Kong University of Science and Technology
Andrea Madotto Hong Kong University of Science and Technology
Zhaojiang Lin Hong Kong University of Science and Technology
Pascale Fung Hong Kong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v35i15.17588

Keywords:

Machine Translation & Multilinguality, Syntax -- Tagging, Chunking & Parsing

Abstract

Cross-lingual models trained on source language tasks possess the capability to directly transfer to target languages. However, since word order variances generally exist in different languages, cross-lingual models that overfit into the word order of the source language could have sub-optimal performance in target languages. In this paper, we hypothesize that reducing the word order information fitted into the models can improve the adaptation performance in target languages. To verify this hypothesis, we introduce several methods to make models encode less word order information of the source language and test them based on cross-lingual word embeddings and the pre-trained multilingual model. Experimental results on three sequence labeling tasks (i.e., part-of-speech tagging, named entity recognition and slot filling tasks) show that reducing word order information injected into the model can achieve better zero-shot cross-lingual performance. Further analysis illustrates that fitting excessive or insufficient word order information into the model results in inferior cross-lingual performance. Moreover, our proposed methods can also be applied to strong cross-lingual models and further improve their performance.

On the Importance of Word Order Information in Cross-lingual Sequence Labeling

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription