[1]
Hu, J., Li, Z., Qi, B., Liu, G. and Wang, P. 2026. End-to-End Contrastive Language-Speech Pretraining Model for Long-Form Spoken Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 37 (Mar. 2026), 31041-31049. DOI:https://doi.org/10.1609/aaai.v40i37.40364.