Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs
DOI:
https://doi.org/10.1609/aaaiss.v8i1.42599Abstract
Large language models (LLMs) are often described as multilingual because they can understand and respond in many languages. However, speaking a language is not the same as reasoning within a culture. This distinction motivates a critical question: do LLMs truly conduct culture-aware reasoning? This paper presents a preliminary computational audit of cultural inclusivity in a creative writing task. We empirically examine whether LLMs act as culturally diverse creative partners or merely as cultural translators that leverage a dominant conceptual framework with localized expressions. Using a metaphor generation task spanning five cultural settings and several abstract concepts as a case study, we find that the model exhibits stereotyped metaphor usage for certain settings, as well as Western defaultism. These findings suggest that merely prompting an LLM with a cultural identity does not guarantee culturally grounded reasoning.Downloads
Published
2026-05-18
How to Cite
Chang, Y., Qu, J., & Li, Z. (2026). Metaphors We Compute By: A Computational Audit of Cultural Translation vs. Thinking in LLMs. Proceedings of the AAAI Symposium Series, 8(1), 636–639. https://doi.org/10.1609/aaaiss.v8i1.42599
Issue
Section
Will AI Light Up Human Creativity or Replace It?