[1]
T. Koike-Akino, X. Chen, J. Liu, Y. Wang, P. (Perry) Wang, and M. Brand, “LatentLLM: Activation-Aware Transform to Multi-Head Latent Attention”, AAAI, vol. 40, no. 27, pp. 22644–22652, Mar. 2026.