Zhang, J., Cai, K., Yang, J., Wang, J., Tang, C. and Wang, K. (2026) “Top-Down Semantic Refinement for Image Captioning”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), pp. 12591-12599. doi: 10.1609/aaai.v40i15.38254.