Kim, H., Oh, S., & Lee, S. (2026). Mitigating Length Bias in RLHF Through a Causal Lens. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17517–17525. https://doi.org/10.1609/aaai.v40i21.38806