(1)
Bai, H.; Shi, P.; Lin, J.; Xie, Y.; Tan, L.; Xiong, K.; Gao, W.; Li, M. Segatron: Segment-Aware Transformer for Language Modeling and Understanding. AAAI 2021, 35, 12526-12534.