Bisecting K-Means in RAG for Enhancing Question-Answering Tasks Performance in Telecommunications
Pedro Sousa, Cláudio Klautau Mello, Frank B. Morte, Luis F. Solis Navarro
TL;DR
This work tackles question-answering in the telecommunications domain by integrating a routing module with Bisecting K-Means clustering into a Retrieval-Augmented Generation pipeline. It processes 3GPP-release documents into content-based clusters, enabling efficient subset retrieval via a vector database and a top-K routing strategy, while fine-tuning a small phi-2 model on RAG-context data. Empirical results show accuracy improvements of 66.12% on phi-2 and 72.13% on phi-3 with reduced training times, and the approach maintains potential applicability to larger LLMs. The work contributes a practical, scalable framework for domain-specific QA in fast-moving standards ecosystems.
Abstract
Question-answering tasks in the telecom domain are still reasonably unexplored in the literature, primarily due to the field's rapid changes and evolving standards. This work presents a novel Retrieval-Augmented Generation framework explicitly designed for the telecommunication domain, focusing on datasets composed of 3GPP documents. The framework introduces the use of the Bisecting K-Means clustering technique to organize the embedding vectors by contents, facilitating more efficient information retrieval. By leveraging this clustering technique, the system pre-selects a subset of clusters that are most similar to the user's query, enhancing the relevance of the retrieved information. Aiming for models with lower computational cost for inference, the framework was tested using Small Language Models, demonstrating improved performance with an accuracy of 66.12% on phi-2 and 72.13% on phi-3 fine-tuned models, and reduced training time.
