Dou, Z.-Y., Tu, Z., Wang, X., Wang, L., Shi, S., & Zhang, T. (2019). Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 86-93. https://doi.org/10.1609/aaai.v33i01.330186