Wang, S., J. Zhang, and C. Zong. “Learning Multimodal Word Representation via Dynamic Fusion Methods”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.12031.