(1)
Stranisci, M. A.; Hardmeier, C. What Are They Filtering Out? An Experimental Benchmark of Filtering Strategies for Harm Reduction in Pretraining Datasets. AAAI 2026, 40, 39303-39313.