Stranisci, M. A., & Hardmeier, C. (2026). What Are They Filtering Out? An Experimental Benchmark of Filtering Strategies for Harm Reduction in Pretraining Datasets. Proceedings of the AAAI Conference on Artificial Intelligence, 40(46), 39303–39313. https://doi.org/10.1609/aaai.v40i46.41279