(1)
Staniszewski, K.; Tworkowski, S.; Jaszczur, S.; Zhao, Y.; Michalewski, H.; Kuciński, Łukasz; Miłoś, P. Structured Packing in LLM Training Improves Long Context Utilization. AAAI 2025, 39, 25201-25209.