KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning
Peng Yu, Cheng Deng, Beiya Dai, Xinbing Wang, Ying Wen
TL;DR
KaLM tackles knowledge deficiencies in autoregressive LLMs by jointly aligning them with knowledge graphs through explicit dual-view contrastive learning and implicit triple-completion language modeling. The final objective, $\mathcal{L}_{KaLM} = \mathcal{L}_{exp} + \lambda \cdot \mathcal{L}_{imp}$, enables robust knowledge representation while preserving generation. Theoretical results show that dual-view contrastive learning promotes knowledge alignment and mitigates representation anisotropy, and experiments demonstrate significant gains in embedding-based KG completion and generation-based KGQA across multiple LLMs when fine-tuned with KaLM. The approach yields more discriminative, uniform knowledge representations and improves KG reasoning without sacrificing language modeling capabilities, suggesting practical benefits for retrieval-augmented and cross-domain knowledge tasks.
Abstract
Autoregressive large language models (LLMs) pre-trained by next token prediction are inherently proficient in generative tasks. However, their performance on knowledge-driven tasks such as factual knowledge querying remains unsatisfactory. Knowledge graphs (KGs), as high-quality structured knowledge bases, can provide reliable knowledge for LLMs, potentially compensating for their knowledge deficiencies. Aligning LLMs with explicit, structured knowledge from KGs has been a challenge; previous attempts either failed to effectively align knowledge representations or compromised the generative capabilities of LLMs, leading to less-than-optimal outcomes. This paper proposes \textbf{KaLM}, a \textit{Knowledge-aligned Language Modeling} approach, which fine-tunes autoregressive LLMs to align with KG knowledge via the joint objective of explicit knowledge alignment and implicit knowledge alignment. The explicit knowledge alignment objective aims to directly optimize the knowledge representation of LLMs through dual-view knowledge graph contrastive learning. The implicit knowledge alignment objective focuses on incorporating textual patterns of knowledge into LLMs through triple completion language modeling. Notably, our method achieves a significant performance boost in evaluations of knowledge-driven tasks, specifically embedding-based knowledge graph completion and generation-based knowledge graph question answering.
