Huang, Chengyu, Zhengxin Zhang, and Claire Cardie. 2026. “HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (37):31122-30. https://doi.org/10.1609/aaai.v40i37.40373.