[1]
J. Zhou, C. Xu, K. Tang, Y. Ge, T. Guo, and L. Cheng, “VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation”, AAAI, vol. 40, no. 16, pp. 13683–13691, Mar. 2026.