MGT-Prism: Enhancing Domain Generalization for Machine-Generated Text Detection via Spectral Alignment

Authors

  • Shengchao Liu Xi'an Jiaotong University
  • Xiaoming Liu Xi'an Jiaotong University
  • Chengzhengxu Li Xi'an Jiaotong University
  • Zhaohan Zhang Queen Mary University of London
  • Guoxin Ma Xi'an Jiaotong University
  • Yu Lan Xi'an Jiaotong University
  • Shuai Xiao Alibaba

DOI:

https://doi.org/10.1609/aaai.v40i38.40485

Abstract

Large Language Models have shown growing ability to generate fluent and coherent texts that are highly similar to the writing style of humans. Current detectors for Machine-Generated Text (MGT) perform well when they are trained and tested in the same domain but generalize poorly to unseen domains, due to domain shift between data from different sources. In this work, we propose MGT-Prism , an MGT detection method from the perspective of the frequency domain for better domain generalization. Our key insight stems from analyzing text representations in the frequency domain, where we observe consistent spectral patterns across diverse domains, while significant discrepancies in magnitude emerge between MGT and human-written texts (HWTs). The observation initiates the design of a low frequency domain filtering module for filtering out the document-level features that are sensitive to domain shift, and a dynamic spectrum alignment strategy to extract the task-specific and domain-invariant features for improving the detector's performance in domain generalization. Extensive experiments demonstrate that MGT-Prism outperforms state‑of‑the‑art baselines by an average of 0.90% in accuracy and 0.92% in F1 score on 11 test datasets across three domain‑generalization scenarios.

Published

2026-03-14

How to Cite

Liu, S., Liu, X., Li, C., Zhang, Z., Ma, G., Lan, Y., & Xiao, S. (2026). MGT-Prism: Enhancing Domain Generalization for Machine-Generated Text Detection via Spectral Alignment. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32132–32140. https://doi.org/10.1609/aaai.v40i38.40485

Issue

Section

AAAI Technical Track on Natural Language Processing III