(1)
Feng, A.; Li, I.; Jiang, Y.; Ying, R. Diffuser: Efficient Transformers With Multi-Hop Attention Diffusion for Long Sequences. AAAI 2023, 37, 12772-12780.