[1]

T. B. Abdur Rakib, A. Mehrish, L.-K. Soon, W. H. Lim, and S. Poria, “DialogXpert: Driving Intelligent and Emotion-Aware Conversations Through Online Value-Based Reinforcement Learning with LLM Priors”, AAAI, vol. 40, no. 36, pp. 29967–29975, Mar. 2026.