Efficient Multi-Task Modeling through Automated Fusion of Trained Models
Jingxuan Zhou, Weidong Bao, Ji Wang, Zhengyi Zhong, Dayu Zhang
TL;DR
The paper addresses the inefficiency of traditional multi-task learning (MTL) that relies on task-specific design choices. It introduces Efficient Multi-Task Modeling (EMM), which automatically fuses trained single-task models by decomposing them into hierarchical components and applying an Adaptive Knowledge Fusion (AKF) module that performs intra-task MoE-based fusion and inter-task MTM via self-attention. Through stacking AKF modules, EMM constructs a flexible, automated multi-task model that accommodates heterogeneous architectures and task relationships. Experimental results on Census-Income, Ali-CCP, and AliExpress demonstrate that EMM achieves strong or superior performance compared to baselines while reducing manual design effort, highlighting its practical value for scalable, automated multi-task modeling.
Abstract
Although multi-task learning is widely applied in intelligent services, traditional multi-task modeling methods often require customized designs based on specific task combinations, resulting in a cumbersome modeling process. Inspired by the rapid development and excellent performance of single-task models, this paper proposes an efficient multi-task modeling method that can automatically fuse trained single-task models with different structures and tasks to form a multi-task model. As a general framework, this method allows modelers to simply prepare trained models for the required tasks, simplifying the modeling process while fully utilizing the knowledge contained in the trained models. This eliminates the need for excessive focus on task relationships and model structure design. To achieve this goal, we consider the structural differences among various trained models and employ model decomposition techniques to hierarchically decompose them into multiple operable model components. Furthermore, we have designed an Adaptive Knowledge Fusion (AKF) module based on Transformer, which adaptively integrates intra-task and inter-task knowledge based on model components. Through the proposed method, we achieve efficient and automated construction of multi-task models, and its effectiveness is verified through extensive experiments on three datasets.
