[1]
Y. Kang, T. Liu, H. Li, Y. Hao, and W. Ding, “Self-Supervised Audio-and-Text Pre-training with Extremely Low-Resource Parallel Data”, AAAI, vol. 36, no. 10, pp. 10875-10883, Jun. 2022.