Zhou, Y., Wu, Y., & Tan, C. (2026). PQDA:Policy-Aligned Q-Consistency Meets Decoupled Augmentation for Generalizable Visual RL. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 29080–29088. https://doi.org/10.1609/aaai.v40i34.40145