[1]
Ruan, L. et al. 2023. Accommodating Audio Modality in CLIP for Multimodal Processing. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 8 (Jun. 2023), 9641–9649. DOI:https://doi.org/10.1609/aaai.v37i8.26153.