[1]
Y. Li, “EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering”, AAAI, vol. 40, no. 8, pp. 6592-6600, Mar. 2026.