Dai, M., Li, J., Zhuang, J., Zhang, X., & Yang, W. (2025). Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints. Proceedings of the AAAI Conference on Artificial Intelligence, 39(3), 2618–2626. https://doi.org/10.1609/aaai.v39i3.32265