(1)
Huang, X.; Huang, Y.-L.; Wen, Z. SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression. AAAI 2025, 39, 17494-17502.