Universal Model in Online Customer Service
Shu-Ting Pi, Cheng-Ping Hsieh, Qun Liu, Yuying Zhu
TL;DR
The paper tackles the retraining burden in online customer service by introducing a universal, training-free retrieval-based approach. It builds a question-label repository using a SeaCat-based sentence tagging model that extracts each transcript’s main customer question, then uses a BM25-style retrieval to predict both categorical (e.g., product/service) and continuous (e.g., handling time) labels from the Top-$N$ similar questions. The approach is evaluated against a Random Forest baseline and shown to improve with larger repositories, outperforming supervised methods in several settings. This method enables rapid deployment and adaptation to new products or events, reducing model development time and operational costs while maintaining predictive performance.
Abstract
Building machine learning models can be a time-consuming process that often takes several months to implement in typical business scenarios. To ensure consistent model performance and account for variations in data distribution, regular retraining is necessary. This paper introduces a solution for improving online customer service in e-commerce by presenting a universal model for predict-ing labels based on customer questions, without requiring training. Our novel approach involves using machine learning techniques to tag customer questions in transcripts and create a repository of questions and corresponding labels. When a customer requests assistance, an information retrieval model searches the repository for similar questions, and statistical analysis is used to predict the corresponding label. By eliminating the need for individual model training and maintenance, our approach reduces both the model development cycle and costs. The repository only requires periodic updating to maintain accuracy.
