Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production

Authors

  • Junhua Liu Forth AI
  • Yong Keat Tan Shopee
  • Bin Fu Shopee
  • Kwan Hui Lim Singapore University of Technology and Design

DOI:

https://doi.org/10.1609/aaai.v40i47.41442

Abstract

Accurate multi-turn intent classification is critical for advancing conversational AI systems but remains challenging due to limited datasets and complex contextual dependencies across dialogue turns. This paper presents two novel approaches leveraging Large Language Models (LLMs) to enhance scalability and reduce latency in production dialogue systems. First, we introduce Symbol Tuning, which simplifies intent labels to reduce task complexity and improve performance in multi-turn dialogues. Second, we propose Consistency-aware, Linguistics Adaptive Retrieval Augmentation (CLARA), a framework that employs LLMs for data augmentation and pseudo-labeling to generate synthetic multi-turn dialogues. These enriched datasets are used to fine-tune a small, efficient model suitable for deployment. Experiments on multilingual dialogue datasets show that our methods result in notable gains in both accuracy and resource efficiency, with improvements of 5.09% in classification accuracy, a 40% reduction in annotation costs, and effective deployment in low-resource multilingual industrial settings.

Published

2026-03-14

How to Cite

Liu, J., Tan, Y. K., Fu, B., & Lim, K. H. (2026). Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production. Proceedings of the AAAI Conference on Artificial Intelligence, 40(47), 40073–40082. https://doi.org/10.1609/aaai.v40i47.41442

Issue

Section

IAAI Technical Track on Deployed Highly Innovative Applications of AI