Li, J., Zhang, Y., Hu, J.-F., Tan, C., Liang, T., & Xia, B. (2026). TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6253–6261. https://doi.org/10.1609/aaai.v40i8.37551