Zhu, M., Liu, Y., Fu, Z., Wang, Q., & Zhang, Y. (2026). In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback. Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), 35195–35203. https://doi.org/10.1609/aaai.v40i41.40826