[1]
H. Kim, S. Oh, and S. Lee, “Mitigating Length Bias in RLHF Through a Causal Lens”, AAAI, vol. 40, no. 21, pp. 17517–17525, Mar. 2026.