1.
Shao Z, Wang Y, Wang Q, Jiang T, Du Z, Ye H, et al. FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 17];40(30):25278-85. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/39720