(1)

Zhang, Z.; Duan, M.; Ye, Y.; Zhang, H. R. Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation. AAAI 2026, 40, 28609-28617.