[1]
M. Zhussip, D. Shopkhoev, A. Ali, and S. Lefkimmiatis, “Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning”, AAAI, vol. 40, no. 34, pp. 29260–29268, Mar. 2026.