Trigger$^3$: Refining Query Correction via Adaptive Model Selector
Kepu Zhang, Zhongxiang Sun, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Yang Song, Jun Xu
TL;DR
This work tackles the challenge of correcting erroneous search queries by combining small, efficient correction models with powerful LLMs. It introduces Trigger$^3$, a three-trigger framework that adaptively routes queries through a Correction Trigger, an LLM Trigger, and a Fallback Trigger to balance accuracy and efficiency. Empirical results on Commercial and QQ datasets show Trigger$^3$ outperforms standalone small models and LLM-based baselines, while maintaining high efficiency due to measured LLM usage. The approach advances practical query correction by leveraging model complementarities and principled decision mechanisms, with potential impact on production search systems.
Abstract
In search scenarios, user experience can be hindered by erroneous queries due to typos, voice errors, or knowledge gaps. Therefore, query correction is crucial for search engines. Current correction models, usually small models trained on specific data, often struggle with queries beyond their training scope or those requiring contextual understanding. While the advent of Large Language Models (LLMs) offers a potential solution, they are still limited by their pre-training data and inference cost, particularly for complex queries, making them not always effective for query correction. To tackle these, we propose Trigger$^3$, a large-small model collaboration framework that integrates the traditional correction model and LLM for query correction, capable of adaptively choosing the appropriate correction method based on the query and the correction results from the traditional correction model and LLM. Trigger$^3$ first employs a correction trigger to filter out correct queries. Incorrect queries are then corrected by the traditional correction model. If this fails, an LLM trigger is activated to call the LLM for correction. Finally, for queries that no model can correct, a fallback trigger decides to return the original query. Extensive experiments demonstrate Trigger$^3$ outperforms correction baselines while maintaining efficiency.
