[1]
M. Zhu, Y. Liu, Z. Fu, Q. Wang, and Y. Zhang, “In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback”, AAAI, vol. 40, no. 41, pp. 35195–35203, Mar. 2026.