[1]
Zhou, Y. et al. 2026. PQDA:Policy-Aligned Q-Consistency Meets Decoupled Augmentation for Generalizable Visual RL. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 34 (Mar. 2026), 29080–29088. DOI:https://doi.org/10.1609/aaai.v40i34.40145.