Shen, Sheng, Zhen Dong, Jiayu Ye, Linjian Ma, Zhewei Yao, Amir Gholami, Michael W. Mahoney, and Kurt Keutzer. “Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 8815-8821. Accessed May 15, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/6409.