Huang, Hen-Hsen. “Democratizing LLM Efficiency: From Hyperscale Optimizations to Universal Deployability”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 46, Mar. 2026, pp. 39707-14, doi:10.1609/aaai.v40i46.41324.