[1]
J. Zhou, “Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation”, AAAI, vol. 40, no. 16, pp. 13665–13673, Mar. 2026.