[1]
Y. Li, Y. Lin, T. Xiao, and J. Zhu, “An Efficient Transformer Decoder with Compressed Sub-layers”, AAAI, vol. 35, no. 15, pp. 13315-13323, May 2021.