[1]
Z. Ou, P. Liang, L. Qiao, J. Han, and B. Liu, “ParaDySe: A Parallel Strategy Switching Framework for Dynamic Sequences in Transformer-based Large Language Models”, AAAI, vol. 40, no. 29, pp. 24648–24655, Mar. 2026.