(1)
Parthiban, D. G. .; Mao, Y.; Inkpen, D. On the Softmax Bottleneck of Recurrent Language Models. AAAI 2021, 35, 13640-13647.