Xu, L., Gao, Y., Song, W., & Hao, A. (2024). Weakly Supervised Multimodal Affordance Grounding for Egocentric Images. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 6324–6332. https://doi.org/10.1609/aaai.v38i6.28451