Han, Z., Shang, M., Wang, X., Liu, Y.-S. and Zwicker, M. (2019) “Y2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 126-133. doi: 10.1609/aaai.v33i01.3301126.