(1)
Li, B.; Yang, F.; Mao, Y.; Ye, Q.; Chen, H.; Zhong, Y. Tri-Ergon: Fine-Grained Video-to-Audio Generation With Multi-Modal Conditions and LUFS Control. AAAI 2025, 39, 4616-4624.