Wu, Houming, and Ling Chen. “TawPipe: Topology-Aware Weight Pipeline Parallelism for Accelerating Long-Context Large Models Training”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 32, Mar. 2026, pp. 26894-02, doi:10.1609/aaai.v40i32.39901.