Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study

Authors

  • Calvin Yixiang Cheng Oxford Internet Institute, University of Oxford
  • Scott A. Hale Oxford Internet Institute, University of Oxford Meedan

DOI:

https://doi.org/10.1609/icwsm.v20i1.42650

Abstract

This study explores computational approaches for measuring moral foundations (MFs) in non-English corpora. Since most resources are developed primarily for English, cross- linguistic applications of moral foundation theory remain limited. Using Chinese as a case study, this paper evaluates the effectiveness of applying English resources to machine translated text, local language lexicons, multilingual encoder-only language models, and decoder-only large language models (LLMs) in measuring MFs in non-English texts. The results indicate that machine translation and local lexicon approaches are insufficient for complex moral assessments, frequently resulting in a substantial loss of cultural information. In contrast, language models demonstrate reliable cross-language performance with transfer learning, with LLMs excelling in terms of data efficiency. Importantly, this study also underscores the need for human-in-the-loop validation of automated MF assessment, as even the most advanced models may overlook cultural nuances and face potential risks in cultural misalignment. The findings highlight the potential of LLMs for cross-language MF measurements and other complex multilingual deductive coding tasks.

Downloads

Published

2026-05-25

How to Cite

Cheng, C. Y., & Hale, S. A. (2026). Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study. Proceedings of the International AAAI Conference on Web and Social Media, 20(1), 487–503. https://doi.org/10.1609/icwsm.v20i1.42650