WANG, Zijun; TU, Haoqin; WANG, Yuhan; WU, Juncheng; LIU, Yanqing; MEI, Jieru; BARTOLDSON, Brian R.; KAILKHURA, Bhavya; XIE, Cihang. STAR-1: Safer Alignment of Reasoning LLMs with 1K Data. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 44, p. 37988–37997, 2026. DOI: 10.1609/aaai.v40i44.41136. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/41136. Acesso em: 25 may. 2026.