Lin, Y. (2026) “GeWu: A Culturally-Grounded Chinese Benchmark for Multi-Stage Social Bias Evaluation in Large Language Models”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), pp. 32033–32041. doi: 10.1609/aaai.v40i38.40474.