Li, Liang, Qingyuan Li, Bo Zhang, and Xiangxiang Chu. 2024. “Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (17):18536-44. https://doi.org/10.1609/aaai.v38i17.29815.