Zhang, J., Xia, W., Dong, H., Lin, Q., & Cao, J. (2026). AP2O-Coder: Adaptively Progressive Preference Optimization for Reducing Compilation and Runtime Errors in LLM-Generated Code. Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), 34701–34709. https://doi.org/10.1609/aaai.v40i41.40771