[1]
M. M. Islam, A. Gladstone, and T. Iqbal, “PATRON: Perspective-Aware Multitask Model for Referring Expression Grounding Using Embodied Multimodal Cues”, AAAI, vol. 37, no. 1, pp. 971-979, Jun. 2023.