1.
Du K, Gemp I, Wu Y, Wu Y. AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard Markov Decision Process (Student Abstract). AAAI [Internet]. 2024 Jul. 15 [cited 2026 May 31];37(13):16204-5. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26962