[1]
Z. Shao, “FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models”, AAAI, vol. 40, no. 30, pp. 25278–25285, Mar. 2026.