Li, Yanyang, Ye Lin, Tong Xiao, and Jingbo Zhu. 2021. “An Efficient Transformer Decoder With Compressed Sub-Layers”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (15):13315-23. https://doi.org/10.1609/aaai.v35i15.17572.