ZHANG, Teresa. Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 48, p. 41537–41539, 2026. DOI: 10.1609/aaai.v40i48.42332. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/42332. Acesso em: 17 may. 2026.