Wang, Hao, Cheng Deng, Fan Ma, and Yi Yang. “Context Modulated Dynamic Networks for Actor and Action Video Segmentation With Language Queries”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (April 3, 2020): 12152-12159. Accessed July 21, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/6895.