Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

Calvin Yixiang Cheng; Scott A. Hale

doi:10.1609/icwsm.v20i1.42650

Authors

Calvin Yixiang Cheng Oxford Internet Institute, University of Oxford
Scott A. Hale Oxford Internet Institute, University of Oxford Meedan

DOI:

https://doi.org/10.1609/icwsm.v20i1.42650

Abstract

This study explores computational approaches for measuring moral foundations (MFs) in non-English corpora. Since most resources are developed primarily for English, cross- linguistic applications of moral foundation theory remain limited. Using Chinese as a case study, this paper evaluates the effectiveness of applying English resources to machine translated text, local language lexicons, multilingual encoder-only language models, and decoder-only large language models (LLMs) in measuring MFs in non-English texts. The results indicate that machine translation and local lexicon approaches are insufficient for complex moral assessments, frequently resulting in a substantial loss of cultural information. In contrast, language models demonstrate reliable cross-language performance with transfer learning, with LLMs excelling in terms of data efficiency. Importantly, this study also underscores the need for human-in-the-loop validation of automated MF assessment, as even the most advanced models may overlook cultural nuances and face potential risks in cultural misalignment. The findings highlight the potential of LLMs for cross-language MF measurements and other complex multilingual deductive coding tasks.

Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information