Koike-Akino, Toshiaki, et al. “LatentLLM: Activation-Aware Transform to Multi-Head Latent Attention”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 27, Mar. 2026, pp. 22644-52, doi:10.1609/aaai.v40i27.39425.