(1)
Li, Y.; Lin, Y.; Xiao, T.; Zhu, J. An Efficient Transformer Decoder With Compressed Sub-Layers. AAAI 2021, 35, 13315-13323.