Li, C., & Wang, X. (2026). DiTEA: Mixture-of-Experts for Vision-Language-Action Model in Robotic Manipulation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18379–18387. https://doi.org/10.1609/aaai.v40i22.38902