[1]
Wang, Y. et al. 2026. APEX-Q: Arbitrary-dimension Product-EXtension Quantization for Accelerated LLM Deployment (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence. 40, 48 (Mar. 2026), 41424–41426. DOI:https://doi.org/10.1609/aaai.v40i48.42293.