[1]

Zhang, Z. et al. 2026. Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 34 (Mar. 2026), 28609–28617. DOI:https://doi.org/10.1609/aaai.v40i34.40092.