Efficient Pareto Manifold Learning with Low-Rank Structure
Weiyu Chen, James T. Kwok
TL;DR
This work tackles the challenge of producing a scalable, continuous Pareto front for multi-task learning with many tasks. It introduces LORPMAN, which decomposes per-layer parameters into a shared main network plus multiple low-rank matrices, enabling efficient parameter sharing and task-specific adaptation, reinforced by orthogonal regularization. The approach is theoretically supported by a universal approximation-like theorem and empirically validated across datasets with varying task counts, showing improved hypervolume and parameter efficiency over state-of-the-art baselines. The method demonstrates strong performance gains, especially as the number of tasks grows, indicating practical benefits for large-scale multi-objective learning. The findings highlight LORPMAN as a flexible, efficient tool for continuous PF learning in complex, real-world MTL settings.
Abstract
Multi-task learning, which optimizes performance across multiple tasks, is inherently a multi-objective optimization problem. Various algorithms are developed to provide discrete trade-off solutions on the Pareto front. Recently, continuous Pareto front approximations using a linear combination of base networks have emerged as a compelling strategy. However, it suffers from scalability issues when the number of tasks is large. To address this issue, we propose a novel approach that integrates a main network with several low-rank matrices to efficiently learn the Pareto manifold. It significantly reduces the number of parameters and facilitates the extraction of shared features. We also introduce orthogonal regularization to further bolster performance. Extensive experimental results demonstrate that the proposed approach outperforms state-of-the-art baselines, especially on datasets with a large number of tasks.
