He, Z. (2025) “Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(16), pp. 17159–17167. doi: 10.1609/aaai.v39i16.33886.