Staniszewski, K., Tworkowski, S., Jaszczur, S., Zhao, Y., Michalewski, H., Kuciński, Łukasz, & Miłoś, P. (2025). Structured Packing in LLM Training Improves Long Context Utilization. Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), 25201–25209. https://doi.org/10.1609/aaai.v39i24.34706