1.
Huang X, Huang Y-L, Wen Z. SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 13];39(16):17494-502. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/33923