Shen, S., Z. Dong, J. Ye, L. Ma, Z. Yao, A. Gholami, M. W. Mahoney, and K. Keutzer. “Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 05, Apr. 2020, pp. 8815-21, doi:10.1609/aaai.v34i05.6409.