E-Logic Prompt: Unified Energy-Logic Framework for Continual Visual Question Answering

Authors

  • Jiayao Tan School of Artificial Intelligence,Tianjin University, Suzhou University of Science and Technology, Key Research Center for Surface Monitoring and Analysis of Relics, State Administration of Cultural Heritage
  • Tianle Liu Suzhou University of Science and Technology
  • Fuyuan Hu Suzhou University of Science and Technology
  • Wei Feng School of Computer Science,Tianjin University Key Research Center for Surface Monitoring and Analysis of Relics, State Administration of Cultural Heritage
  • Liang Wan School of Software,Tianjin University Key Research Center for Surface Monitoring and Analysis of Relics, State Administration of Cultural Heritage

DOI:

https://doi.org/10.1609/aaai.v40i30.39775

Abstract

Prompt tuning has shown promise for continual visual question answering (CVQA), facilitating modular and transferable knowledge across tasks. However, existing approaches often overlook the guiding role of prompts in the model’s implicit reasoning process. This oversight can lead to inconsistent reasoning paths and performance degradation across tasks. To address this issue, we propose the E Logic Prompt framework, which employs energy-based models (EBMs) to model the semantic compatibility between prompts and queries. In this framework, prompts function not only as adapters but also as reasoning guides that help maintain coherence throughout the inference process. The framework enforces logical consistency at three levels. At the input level, it selects semantically aligned prompts by minimizing the energy between queries and prompts. Within the model, it aligns intermediate representations with prompts across layers to preserve step-by-step reasoning. Across tasks, it applies energy-based constraints to regulate prompt behavior, effectively suppressing semantic drift and enabling prompt reuse. These three levels of consistency together enhance the guiding capacity of prompts, allowing them to steer the model toward more stable and coherent reasoning. Extensive experiments show that E Logic Prompt outperforms existing methods in both accuracy and knowledge retention, while effectively maintaining balanced cross-modal reasoning throughout continual learning.

Published

2026-03-14

How to Cite

Tan, J., Liu, T., Hu, F., Feng, W., & Wan, L. (2026). E-Logic Prompt: Unified Energy-Logic Framework for Continual Visual Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), 25770–25777. https://doi.org/10.1609/aaai.v40i30.39775

Issue

Section

AAAI Technical Track on Machine Learning VII