Koike-Akino, T., Chen, X., Liu, J., Wang, Y., Wang, P. (Perry), & Brand, M. (2026). LatentLLM: Activation-Aware Transform to Multi-Head Latent Attention. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22644–22652. https://doi.org/10.1609/aaai.v40i27.39425