Shao, Zishan, et al. “FlashSVD: Memory-Efficient Inference With Streaming for Low-Rank Models”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 30, Mar. 2026, pp. 25278-85, doi:10.1609/aaai.v40i30.39720.