Shao, Z., Wang, Y., Wang, Q., Jiang, T., Du, Z., Ye, H., … ¨Helen¨ Li, H. (2026). FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), 25278–25285. https://doi.org/10.1609/aaai.v40i30.39720