[1]
M. Holla and I. Lourentzou, “Commonsense for Zero-Shot Natural Language Video Localization”, AAAI, vol. 38, no. 3, pp. 2166–2174, Mar. 2024.