Wang, H., Deng, C., Ma, F., & Yang, Y. (2020). Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 12152-12159. https://doi.org/10.1609/aaai.v34i07.6895