Eigenfunction Extraction for Ordered Representation Learning
Burak Varıcı, Che-Ping Tsai, Ritabrata Ray, Nicholas M. Boffi, Pradeep Ravikumar
TL;DR
The paper tackles identifiability in representation learning by reframing learned features as ordered eigenfunctions of a contextual kernel $k_{XA}$ with associated operator $T_{XA}$. It proposes a modular eigenfunction extraction framework that can recover exact eigenfunctions and eigenvalues, not just an eigenspace, by combining base eigenspace extractors with sequential/joint nesting or Rayleigh–Ritz post-processing. The authors formalize desiderata (compatibility, exact decomposition, unconstrained optimization, efficiency) and show how two mainstream paradigms, LoRA (low-rank approximation) and Rayleigh quotient optimization, fit within this framework and connect to contrastive and non-contrastive learning. Through synthetic kernel experiments and image-representation tasks, they demonstrate that recovered eigenvalues provide meaningful feature-importance scores enabling adaptive-dimensional representations, with practical guidance on method choice (e.g., Rayleigh–Ritz for VICReg, joint nesting for SCL). Overall, the work offers a principled, scalable route to identifiable, ordered representations that support principled efficiency-accuracy tradeoffs in large-scale systems.
Abstract
Recent advances in representation learning reveal that widely used objectives, such as contrastive and non-contrastive, implicitly perform spectral decomposition of a contextual kernel, induced by the relationship between inputs and their contexts. Yet, these methods recover only the linear span of top eigenfunctions of the kernel, whereas exact spectral decomposition is essential for understanding feature ordering and importance. In this work, we propose a general framework to extract ordered and identifiable eigenfunctions, based on modular building blocks designed to satisfy key desiderata, including compatibility with the contextual kernel and scalability to modern settings. We then show how two main methodological paradigms, low-rank approximation and Rayleigh quotient optimization, align with this framework for eigenfunction extraction. Finally, we validate our approach on synthetic kernels and demonstrate on real-world image datasets that the recovered eigenvalues act as effective importance scores for feature selection, enabling principled efficiency-accuracy tradeoffs via adaptive-dimensional representations.
