(1)
Zhou, Y.; Wu, Y.; Tan, C. PQDA:Policy-Aligned Q-Consistency Meets Decoupled Augmentation for Generalizable Visual RL. AAAI 2026, 40, 29080-29088.