Zhang, R., Dong, M., Zhang, Y., Heng, L., Chi, X., Dai, G., … Zhang, S. (2026). MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18764–18772. https://doi.org/10.1609/aaai.v40i22.38945