[1]
C. Li and X. Wang, “DiTEA: Mixture-of-Experts for Vision-Language-Action Model in Robotic Manipulation”, AAAI, vol. 40, no. 22, pp. 18379–18387, Mar. 2026.