[1]
T. Zhang, “Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling”, AAAI, vol. 40, no. 48, pp. 41537–41539, Mar. 2026.