Zhang, T., Duan, H., Hao, H., Qiao, Y., Dai, J., & Hou, Z. (2026). Grounding Actions in Camera Space: Observation-Centric Vision-Language-Action Policy. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18782–18790. https://doi.org/10.1609/aaai.v40i22.38947