KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

Qihuang Zhong; Liang Ding; Xiantao Cai; Juhua Liu; Bo Du; Dacheng Tao

KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

Qihuang Zhong, Liang Ding, Xiantao Cai, Juhua Liu, Bo Du, Dacheng Tao

TL;DR

Domain-specific QA with SFT often suffers from knowledge conflict between an LLM's internal knowledge and training data. The authors propose KaFT, a knowledge-aware fine-tuning framework, built on a robust query-diversification conflict detector and sample-adaptive rewards that weight training data by conflict level; this suppresses harmful signals while leveraging useful conflict information. Empirical results across multiple LLMs (LLaMA3, Qwen, Mistral) and diverse medical, multilingual, and out-of-domain benchmarks show consistent gains and reduced hallucination, with notable improvements in OOD robustness. The findings indicate KaFT’s potential to generalize beyond medical QA to broader domain-specific tasks, improving both performance and reliability of LLMs in specialized settings.

Abstract

Supervised fine-tuning (SFT) is a common approach to improve the domain-specific question-answering (QA) performance of large language models (LLMs). However, recent literature reveals that due to the conflicts between LLMs' internal knowledge and the context knowledge of training data, vanilla SFT using the full QA training set is usually suboptimal. In this paper, we first design a query diversification strategy for robust conflict detection and then conduct a series of experiments to analyze the impact of knowledge conflict. We find that 1) training samples with varied conflicts contribute differently, where SFT on the data with large conflicts leads to catastrophic performance drops; 2) compared to directly filtering out the conflict data, appropriately applying the conflict data would be more beneficial. Motivated by this, we propose a simple-yet-effective Knowledge-aware Fine-tuning (namely KaFT) approach to effectively boost LLMs' performance. The core of KaFT is to adapt the training weight by assigning different rewards for different training samples according to conflict level. Extensive experiments show that KaFT brings consistent and significant improvements across four LLMs. More analyses prove that KaFT effectively improves the model generalization and alleviates the hallucination.

KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

TL;DR

Abstract

KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)