[1]
H. Wu and L. Chen, “TawPipe: Topology-Aware Weight Pipeline Parallelism for Accelerating Long-Context Large Models Training”, AAAI, vol. 40, no. 32, pp. 26894–26902, Mar. 2026.