1.
Lin Y, Zhou Z, Gao J, Guo X, Zhang J, Wu H, et al. GeWu: A Culturally-Grounded Chinese Benchmark for Multi-Stage Social Bias Evaluation in Large Language Models. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 13];40(38):32033-41. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/40474