Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Hanyu Zhao; Li Du; Yiming Ju; Chengwei Wu; Tengfei Pan

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Hanyu Zhao, Li Du, Yiming Ju, Chengwei Wu, Tengfei Pan

TL;DR

This paper systemically investigates interaction and dependency patterns between different categories of instructions, manages to optimize the instruction set concerning the interaction patterns using a linear programming-based method, and optimize the learning schema of SFT using an instruction dependency taxonomy guided curriculum learning.

Abstract

With the availability of various instruction datasets, a pivotal challenge is how to effectively select and integrate these instructions to fine-tune large language models (LLMs). Previous research mainly focuses on selecting individual high-quality instructions. However, these works overlooked the joint interactions and dependencies between different categories of instructions, leading to suboptimal selection strategies. Moreover, the nature of these interaction patterns remains largely unexplored, let alone optimize the instruction set with regard to them. To fill these gaps, in this paper, we: (1) systemically investigate interaction and dependency patterns between different categories of instructions, (2) manage to optimize the instruction set concerning the interaction patterns using a linear programming-based method, and optimize the learning schema of SFT using an instruction dependency taxonomy guided curriculum learning. Experimental results across different LLMs demonstrate improved performance over strong baselines on widely adopted benchmarks.

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

TL;DR

Abstract

Paper Structure (28 sections, 5 equations, 7 figures, 6 tables)

This paper contains 28 sections, 5 equations, 7 figures, 6 tables.

Introduction
Causal Intervention based Instruction Correlation Analysis and Ability Taxonomy Induction
Instruction Collection and Automatic Ability Tagging System
Causal Intervention based Instruction Correlation Analysis
Experimental Settings
Analysis Results
Causal Intervention based Large Language Model Ability Taxonomy Induction
Experimental Settings
An Empirical Ability Taxonomy of LLM
Category Relationship Guided Instruction Set Optimization
Effect Equivalence-based Category Proportion Optimization
Experimental Settings
Baseline Methods
Results
Ability Dependency Taxonomy Guided Curriculum Instruction Learning
...and 13 more sections

Figures (7)

Figure 1: Framework of our work. Baseline methods selection instructions using quality scores (a). In this paper, we first induce the correlation pattern (b) and dependency taxonomy (d), then optimize the instruction set collection concerning the correlation (c) and dependency taxonomy (e).
Figure 2: The effect equivalence coefficients between different categories of instructions.
Figure 3: Performance of Llama3-8B and Qwen 1.5-7B fine-tuned on instruction set obtained by EE-CPO and DEITA with different sample sizes.
Figure 4: Performance of Llama3-8B and Qwen 1.5-7B fine-tuned on instruction set obtained by DF-CSFT and DEITA with different sample sizes.
Figure 5: The effect equivalence coefficients between different categories of instructions derived by Qwen and Llama, respectively.
...and 2 more figures

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

TL;DR

Abstract

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Authors

TL;DR

Abstract

Table of Contents

Figures (7)