Qiu, R. (2026) “LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), pp. 25009–25017. doi: 10.1609/aaai.v40i30.39689.