Li, L., Li, Q., Zhang, B., & Chu, X. (2024). Norm Tweaking: High-Performance Low-Bit Quantization of Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18536-18544. https://doi.org/10.1609/aaai.v38i17.29815