Measuring Fine-Grained Relatedness in Multitask Learning via Data Attribution

Yiwen Tu; Ziqi Liu; Jiaqi W. Ma; Weijing Tang

Measuring Fine-Grained Relatedness in Multitask Learning via Data Attribution

Yiwen Tu, Ziqi Liu, Jiaqi W. Ma, Weijing Tang

TL;DR

This work extends data attribution -- which quantifies the influence of individual training data points on model predictions -- to MTL setting for measuring task relatedness, offering an efficient and fine-grained solution for measuring task relatedness and enhancing MTL models.

Abstract

Measuring task relatedness and mitigating negative transfer remain a critical open challenge in Multitask Learning (MTL). This work extends data attribution -- which quantifies the influence of individual training data points on model predictions -- to MTL setting for measuring task relatedness. We propose the MultiTask Influence Function (MTIF), a method that adapts influence functions to MTL models with hard or soft parameter sharing. Compared to conventional task relatedness measurements, MTIF provides a fine-grained, instance-level relatedness measure beyond the entire-task level. This fine-grained relatedness measure enables a data selection strategy to effectively mitigate negative transfer in MTL. Through extensive experiments, we demonstrate that the proposed MTIF efficiently and accurately approximates the performance of models trained on data subsets. Moreover, the data selection strategy enabled by MTIF consistently improves model performance in MTL. Our work establishes a novel connection between data attribution and MTL, offering an efficient and fine-grained solution for measuring task relatedness and enhancing MTL models.

Measuring Fine-Grained Relatedness in Multitask Learning via Data Attribution

TL;DR

Abstract

Measuring Fine-Grained Relatedness in Multitask Learning via Data Attribution

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (13)