[1]
R. Tanaka, K. Nishida, K. Nishida, T. Hasegawa, I. Saito, and K. Saito, “SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images”, AAAI, vol. 37, no. 11, pp. 13636-13645, Jun. 2023.