[1]
P. Bao, Y. Xia, W. Yang, B. P. Ng, M. H. Er, and A. C. Kot, “Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video Grounding”, AAAI, vol. 38, no. 2, pp. 738-746, Mar. 2024.