Hu, J., Li, Z., Qi, B., Liu, G., & Wang, P. (2026). End-to-End Contrastive Language-Speech Pretraining Model for Long-Form Spoken Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 40(37), 31041-31049. https://doi.org/10.1609/aaai.v40i37.40364