(1)

Zhang, W. Reinforcement Learning Without Explicit Rewards: Theory and Practice. AAAI 2026, 40, 39847-39847.