Zhou, J., Zhou, Y., Han, M., Wang, T., Chang, X., Cholakkal, H., & Anwer, R. M. (2026). Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 13665–13673. https://doi.org/10.1609/aaai.v40i16.38373