Few-shot Transfer Learning for Knowledge Base Question Answering: Fusing Supervised Models with In-Context Learning
Mayur Patidar, Riya Sawhney, Avinash Singh, Biswajit Chatterjee, Mausam, Indrajit Bhattacharya
TL;DR
FuSIC-KBQA tackles few-shot transfer learning for Knowledge Base Question Answering by fusing multiple source-trained retrieval models with large-language-model in-context learning. The system retrieves KB context with complementary retrievers, uses LLMs to re-rank and generate SPARQL, and applies execution-guided feedback to refine queries. Across multiple source–target pairs and limited target supervision, FuSIC-KBQA significantly outperforms adapted state-of-the-art supervised and FS-ICL KBQA approaches, and it also shows advantages in in-domain, low-data settings. The approach reduces annotation costs and demonstrates robust cross-domain transfer and in-domain applicability, with potential to benefit from open-source LLMs and expanded retriever sets in future work.
Abstract
Existing Knowledge Base Question Answering (KBQA) architectures are hungry for annotated data, which make them costly and time-consuming to deploy. We introduce the problem of few-shot transfer learning for KBQA, where the target domain offers only a few labeled examples, but a large labeled training dataset is available in a source domain. We propose a novel KBQA architecture called FuSIC-KBQA that performs KB-retrieval using multiple source-trained retrievers, re-ranks using an LLM and uses this as input for LLM few-shot in-context learning to generate logical forms. These are further refined using execution-guided feedback. Experiments over multiple source-target KBQA pairs of varying complexity show that FuSIC-KBQA significantly outperforms adaptations of SoTA KBQA models for this setting. Additional experiments show that FuSIC-KBQA also outperforms SoTA KBQA models in the in-domain setting when training data is limited.
