Zhang, T. (2026). Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41537–41539. https://doi.org/10.1609/aaai.v40i48.42332