Tan, Jiawei, Hongxing Wang, Kang Dang, Jiaxin Li, and Zhilong Ou. “Modality-Aware Shot Relating and Comparing for Video Scene Detection”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 7 (April 11, 2025): 7193-7201. Accessed May 1, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32773.