Zhang, Z. (2026) “Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), pp. 28609–28617. doi: 10.1609/aaai.v40i34.40092.