1.
Zhang W, Ying Y, Lu P, Zha H. Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption. AAAI [Internet]. 2020Apr.3 [cited 2024Mar.2];34(05):9571-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/6503