BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection

Authors

  • Xuan Luo Harbin Institute of Technology, Shenzhen, China Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
  • Li Yang Harbin Institute of Technology, Shenzhen, China
  • Han Zhang Harbin Institute of Technology, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China
  • Geng Tu Harbin Institute of Technology, Shenzhen, China
  • Qianlong Wang Harbin Institute of Technology, Shenzhen, China
  • Keyang Ding Harbin Institute of Technology, Shenzhen, China
  • Chuang Fan Harbin Institute of Technology, Shenzhen, China
  • Jing Li Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China Research Centre on Data Science & Artificial Intelligence, Hong Kong, China
  • Ruifeng Xu Harbin Institute of Technology, Shenzhen, China Peng Cheng Laboratory, Shenzhen, China Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies, China

DOI:

https://doi.org/10.1609/aaai.v39i23.34656

Abstract

Sexism affects both women and men, yet research often overlooks misandry and suffers from overly broad annotations that limit AI applications. To address this, we introduce BeyondGender, a dataset meticulously annotated according to the latest definitions of misogyny and misandry. It features innovative multifaceted labels encompassing aspects of sexism, gender, phrasing, misogyny, and misandry. The dataset includes 6K English and 1.7K Chinese sexism instances, alongside 13K non-sexism examples. Our evaluations of masked language models and large language models reveal that they detect misogyny in English and misandry in Chinese more effectively, with F1-scores of 0.87 and 0.62, respectively. However, they frequently misclassify hostile and mild comments, underscoring the complexity of sexism detection. Parallel corpus experiments suggest promising data augmentation strategies to enhance AI systems for nuanced sexism detection, and our dataset can be leveraged to improve value alignment in large language models.

Published

2025-04-11

How to Cite

Luo, X., Yang, L., Zhang, H., Tu, G., Wang, Q., Ding, K., … Xu, R. (2025). BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 39(23), 24750–24758. https://doi.org/10.1609/aaai.v39i23.34656

Issue

Section

AAAI Technical Track on Natural Language Processing II