1.
Qiu L, Ning S, He X. Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training. AAAI [Internet]. 2024Mar.24 [cited 2024Sep.12];38(5):4605-13. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/28260