(1)
Chen, W.; Niu, J.; Liu, X.; Wang, Z.; Tang, S.; Zhu, G. DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models. AAAI 2025, 39, 2221-2229.