Zhang, Teresa. “Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 48, Mar. 2026, pp. 41537-9, doi:10.1609/aaai.v40i48.42332.