(1)
Zhang, W.; Ying, Y.; Lu, P.; Zha, H. Learning Long- and Short-Term User Literal-Preference With Multimodal Hierarchical Transformer Network for Personalized Image Caption. AAAI 2020, 34, 9571-9578.