[1]

D. Liu, “Unsupervised Temporal Video Grounding with Deep Semantic Clustering”, AAAI, vol. 36, no. 2, pp. 1683-1691, Jun. 2022.