(1)
Zhao, M.; Hu, W.; Wang, J.; Lai, X.; Huang, T.; Min, Y.; Yan, R.; Zhu, X. Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off. AAAI 2026, 40, 34959-34967.