Learning Multiple Tasks with Multilinear Relationship Networks
Mingsheng Long, Zhangjie Cao, Jianmin Wang, Philip S. Yu
TL;DR
The paper tackles multi-task learning in deep networks by explicitly modeling relationships across tasks, features, and classes. It introduces Multilinear Relationship Networks (MRN), which place tensor normal priors on task-specific layer parameters to learn high-order, multilinear relationships via Kronecker-structured covariances and a MAP training objective. An alternating optimization strategy updates network weights and covariance factors, enabling joint learning of transferable features and task relationships. Empirical results on three benchmarks show MRN achieving state-of-the-art accuracy, with visualization confirming meaningful task covariances and improved feature transferability. The approach offers a flexible, scalable framework that generalizes across backbones and task settings, addressing both negative-transfer in feature layers and under-transfer in classifiers.
Abstract
Deep networks trained on large-scale data can learn transferable features to promote learning multiple tasks. Since deep features eventually transition from general to specific along deep networks, a fundamental problem of multi-task learning is how to exploit the task relatedness underlying parameter tensors and improve feature transferability in the multiple task-specific layers. This paper presents Multilinear Relationship Networks (MRN) that discover the task relationships based on novel tensor normal priors over parameter tensors of multiple task-specific layers in deep convolutional networks. By jointly learning transferable features and multilinear relationships of tasks and features, MRN is able to alleviate the dilemma of negative-transfer in the feature layers and under-transfer in the classifier layer. Experiments show that MRN yields state-of-the-art results on three multi-task learning datasets.
