SHAO, Zishan; WANG, Yixiao; WANG, Qinsi; JIANG, Ting; DU, Zhixu; YE, Hancheng; ZHUO, Danyang; CHEN, Yiran; ¨HELEN¨ LI, Hai. FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 30, p. 25278–25285, 2026. DOI: 10.1609/aaai.v40i30.39720. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/39720. Acesso em: 17 may. 2026.