[1]
W. Zhang, “Reinforcement Learning Without Explicit Rewards: Theory and Practice”, AAAI, vol. 40, no. 47, pp. 39847–39847, Mar. 2026.