Hu, J., Z. Li, B. Qi, G. Liu, and P. Wang. “End-to-End Contrastive Language-Speech Pretraining Model for Long-Form Spoken Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 37, Mar. 2026, pp. 31041-9, doi:10.1609/aaai.v40i37.40364.