[1]
D. G. . Parthiban, Y. Mao, and D. Inkpen, “On the Softmax Bottleneck of Recurrent Language Models”, AAAI, vol. 35, no. 15, pp. 13640-13647, May 2021.