[1]

Y. Wang, “What-Meets-Where: Unified Learning of Action and Contact Localization in Images”, AAAI, vol. 40, no. 21, pp. 17832–17840, Mar. 2026.