Sarkar, P., & Etemad, A. (2024). XKD: Cross-Modal Knowledge Distillation with Domain Alignment for Video Representation Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 14875-14885. https://doi.org/10.1609/aaai.v38i13.29407