1.
Feng A, Li I, Jiang Y, Ying R. Diffuser: Efficient Transformers with Multi-Hop Attention Diffusion for Long Sequences. AAAI [Internet]. 2023Jun.26 [cited 2024Sep.12];37(11):12772-80. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26502