[1]

H. Dubey and C. Pack, “Leveraging Textual Memory and Key Frame Reasoning for Full Video Understanding Using Off-the-Shelf LLMs and VLMs (Student Abstract)”, AAAI, vol. 39, no. 28, pp. 29351-29352, Apr. 2025.