Kong, R., Wu, C., & Zhang, Z. (2024). Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23546–23547. https://doi.org/10.1609/aaai.v38i21.30466