Bai, H., P. Shi, J. Lin, Y. Xie, L. Tan, K. Xiong, W. Gao, and M. Li. “Segatron: Segment-Aware Transformer for Language Modeling and Understanding”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 14, May 2021, pp. 12526-34, https://ojs.aaai.org/index.php/AAAI/article/view/17485.