[1]

X. Wang, “Efficient Post-Training Refinement of Latent Reasoning in Large Language Models”, AAAI, vol. 40, no. 40, pp. 33692–33700, Mar. 2026.