Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces
Tyler Ingebrand, Adam J. Thorpe, Ufuk Topcu
TL;DR
This work introduces a geometric framework for inductive transfer in a Hilbert space $\mathcal{H}$, defining three transfer types: interpolation within the convex hull $C_h$, extrapolation to the linear span $\operatorname{span}\{f_{S_i}\}$, and extrapolation to the full space $\mathcal{H}$. It proposes function encoders that learn a fixed neural-basis representation, enabling efficient online adaptation by expressing target tasks as linear combinations of basis functions with coefficients computed via least-squares, a method supported by a universal function space approximation theorem. The authors prove that any function in a separable Hilbert space can be approximated arbitrarily well by finite neural-basis combinations, and they generalize inner products to various output spaces. Empirically, FE(LS) matches or outperforms baselines across polynomial, CIFAR, 7-Scenes, and MuJoCo tasks, with especially strong performance on Type 2 and Type 3 transfer, highlighting the method's ability to transfer knowledge to unseen tasks without retraining.
Abstract
A central challenge in transfer learning is designing algorithms that can quickly adapt and generalize to new tasks without retraining. Yet, the conditions of when and how algorithms can effectively transfer to new tasks is poorly characterized. We introduce a geometric characterization of transfer in Hilbert spaces and define three types of inductive transfer: interpolation within the convex hull, extrapolation to the linear span, and extrapolation outside the span. We propose a method grounded in the theory of function encoders to achieve all three types of transfer. Specifically, we introduce a novel training scheme for function encoders using least-squares optimization, prove a universal approximation theorem for function encoders, and provide a comprehensive comparison with existing approaches such as transformers and meta-learning on four diverse benchmarks. Our experiments demonstrate that the function encoder outperforms state-of-the-art methods on four benchmark tasks and on all three types of transfer.
