(1)
Liu, Y.; Peng, D.; Wei, W.; Fu, Y.; Xie, W.; Chen, D. Detection-Based Intermediate Supervision for Visual Question Answering. AAAI 2024, 38, 14061-14068.