Structured Output Regularization: a framework for few-shot transfer learning
Nicolas Ewen, Jairo Diaz-Rodriguez, Kelly Ramsay
TL;DR
Structured Output Regularization (SOR) addresses the challenge of adapting pretrained models with very limited labeled data by freezing internal structures and regulating their outputs rather than unfreezing weights. By inserting trainable beta weights between frozen blocks and applying $L_1$ penalties on these weights together with group-lasso penalties on block inputs, SOR enables targeted pruning and task-tailored adaptation with a minimal parameter budget. The authors provide implementations for convolutional layers, DenseNet blocks, and EfficientNet blocks, and demonstrate competitive few-shot performance on three medical-imaging tasks against established baselines, while enabling data-driven pruning. This approach offers a practical, data-efficient transfer-learning paradigm with broad applicability to CNN-based architectures and potential extensions to transformers, contributing to more parameter-efficient and robust domain adaptation in medical imaging and beyond.
Abstract
Traditional transfer learning typically reuses large pre-trained networks by freezing some of their weights and adding task-specific layers. While this approach is computationally efficient, it limits the model's ability to adapt to domain-specific features and can still lead to overfitting with very limited data. To address these limitations, we propose Structured Output Regularization (SOR), a simple yet effective framework that freezes the internal network structures (e.g., convolutional filters) while using a combination of group lasso and $L_1$ penalties. This framework tailors the model to specific data with minimal additional parameters and is easily applicable to various network components, such as convolutional filters or various blocks in neural networks enabling broad applicability for transfer learning tasks. We evaluate SOR on three few shot medical imaging classification tasks and we achieve competitive results using DenseNet121, and EfficientNetB4 bases compared to established benchmarks.
