[1]

Y. Zhou, Y. Wu, and C. Tan, “PQDA:Policy-Aligned Q-Consistency Meets Decoupled Augmentation for Generalizable Visual RL”, AAAI, vol. 40, no. 34, pp. 29080–29088, Mar. 2026.