Zhang, W. (2026). Reinforcement Learning Without Explicit Rewards: Theory and Practice. Proceedings of the AAAI Conference on Artificial Intelligence, 40(47), 39847–39847. https://doi.org/10.1609/aaai.v40i47.41364