[1]
L. Xue, X. Li, and N. L. Zhang, “Not All Attention Is Needed: Gated Attention Network for Sequence Data”, AAAI, vol. 34, no. 04, pp. 6550-6557, Apr. 2020.