Qiu, Ruiyu, Rui Wang, Guanghui Yang, Xiang Li, and Zhijiang Shao. 2026. “LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning With Subproblem Exploration”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (30):25009-17. https://doi.org/10.1609/aaai.v40i30.39689.