Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching
Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Yipeng Zhang, Haitao Mi, Helen Meng
TL;DR
Self-Tuning presents a three-stage framework for teaching LLMs to acquire new knowledge from unseen raw documents, inspired by the Feynman Technique. A novel Self-Teaching strategy decomposes learning into memorization, comprehension, and self-reflection, with tasks designed to be self-supervised and document-driven. The Wiki-Newpages-2023-QA datasets enable rigorous evaluation of memorization, extraction, and reasoning across single-, multi-, and cross-domain settings. Empirical results across multiple models show that Self-Tuning consistently improves knowledge acquisition and retention while maintaining prior knowledge, and it demonstrates favorable training efficiency relative to baselines. The work also outlines avenues for integration with continual learning and broader evaluations, highlighting practical potential for up-to-date knowledge in LLMs.
Abstract
Large language models (LLMs) often struggle to provide up-to-date information due to their one-time training and the constantly evolving nature of the world. To keep LLMs current, existing approaches typically involve continued pre-training on new documents. However, they frequently face difficulties in extracting stored knowledge. Motivated by the remarkable success of the Feynman Technique in efficient human learning, we introduce Self-Tuning, a learning framework aimed at improving an LLM's ability to effectively acquire new knowledge from unseen raw documents through self-teaching. Specifically, we develop a Self-Teaching strategy that augments the documents with a set of knowledge-intensive tasks created in a self-supervised manner, focusing on three crucial aspects: memorization, comprehension, and self-reflection. Additionally, we introduce three Wiki-Newpages-2023-QA datasets to facilitate an in-depth analysis of an LLM's knowledge acquisition ability concerning memorization, extraction, and reasoning. Extensive experimental results on various models, e.g., Llama2-7B reveal that Self-Tuning consistently exhibits superior performance across all knowledge acquisition tasks and excels in preserving previous knowledge.
