Building Resilient Educational AI Through Multimodal Context Fusion

Authors

  • Donggil Song Texas A&M University
  • Anne Lippert Prairie View A&M University

DOI:

https://doi.org/10.1609/aaaiss.v9i1.42919

Abstract

Educational systems face escalating challenges: diverse learner needs, resource limits, and rapid technological change. Resilient AI is needed to remain useful as learner behavior, interface conditions, and infrastructure vary. We present a position on resilient educational AI through multimodal context fusion, demonstrated with a virtual reality (VR) algebra learning system. We define resilience operationally through three measurable indicators: (1) action-grounding accuracy, (2) tolerance to latency variation and fallback conditions, and (3) stability under changing learner behaviors across mission phases. In a controlled ablation using 30 scripted traces, the context-grounded system achieved 92% action reference accuracy and reduced hallucination events by 86% relative to a mission-only baseline. Complementary pilot logs from live VR use show sustained learner-initiated interaction and stable response timing, supporting the human-AI collaboration claim while remaining bounded by the absence of direct trust/usability surveys. We position multimodal context fusion as a practical design pattern for robust, adaptable, and scalable educational AI, with implications for broader resilience domains.

Downloads

Published

2026-06-23

How to Cite

Song, D., & Lippert, A. (2026). Building Resilient Educational AI Through Multimodal Context Fusion. Proceedings of the AAAI Symposium Series, 9(1), 157–161. https://doi.org/10.1609/aaaiss.v9i1.42919

Issue

Section

AI-Driven Resilience: Building Robust, Adaptive Technologies for a Dynamic World (Short Papers)