Domain Adaptation in Intent Classification Systems: A Review
Jesse Atuhurra, Hidetaka Kamigaito, Taro Watanabe, Eric Nichols
TL;DR
This paper provides a structured survey of domain adaptation in intent classification for dialogue systems, detailing the landscape of datasets, languages, and domains used to train and evaluate ICS. It synthesizes three core methodological approaches—fine-tuning pretrained language models, prompting, and few-shot/zero-shot learning—and highlights representative techniques and datasets that illustrate current capabilities and gaps. The authors identify key limitations in domain adaptation, such as dataset biases, monolingual focus, and the need for multimodal inputs, and offer concrete directions like multilingual, multimodal datasets, conversational pretraining, adapters, and contrastive learning to address these challenges. The work aims to guide future dataset creation and method development to enable robust, domain-diverse, and scalable intent classification in real-world dialogue systems.
Abstract
Dialogue agents, which perform specific tasks, are part of the long-term goal of NLP researchers to build intelligent agents that communicate with humans in natural language. Such systems should adapt easily from one domain to another to assist users in completing tasks. Researchers have developed a broad range of techniques, objectives, and datasets for intent classification to achieve such systems. Despite the progress in developing intent classification systems (ICS), a systematic review of the progress from a technical perspective is yet to be conducted. In effect, important implementation details of intent classification remain restricted and unclear, making it hard for natural language processing (NLP) researchers to develop new methods. To fill this gap, we review contemporary works in intent classification. Specifically, we conduct a thorough technical review of the datasets, domains, tasks, and methods needed to train the intent classification part of dialogue systems. Our structured analysis describes why intent classification is difficult and studies the limitations to domain adaptation while presenting opportunities for future work.
