(1)

Sarkar, P.; Etemad, A. XKD: Cross-Modal Knowledge Distillation With Domain Alignment for Video Representation Learning. AAAI 2024, 38, 14875-14885.