Zhou, J., Xu, C., Tang, K., Ge, Y., Guo, T., & Cheng, L. (2026). VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 13683–13691. https://doi.org/10.1609/aaai.v40i16.38375