RedEx: Beyond Fixed Representation Methods via Convex Optimization
Amit Daniely, Mariano Schain, Gilad Yehudai
TL;DR
This work addresses the gap between the optimization guarantees of fixed representations and the expressive power of neural networks by introducing RedEx, a Reduced Extractor Expander architecture. RedEx can match neural networks in expressiveness while enabling layerwise training via convex semidefinite programs, providing provable guarantees without assumptions on the input distribution. A key contribution is a separation result showing RedEx can efficiently learn a family of functions that fixed representation methods cannot, along with an efficient training algorithm and extensions to multi-layer and convolutional forms. The norm-based formulation and convolutional extensions further broaden applicability, suggesting a path toward scalable, provably learnable representation learning outside standard SDP frameworks.
Abstract
Optimizing Neural networks is a difficult task which is still not well understood. On the other hand, fixed representation methods such as kernels and random features have provable optimization guarantees but inferior performance due to their inherent inability to learn the representations. In this paper, we aim at bridging this gap by presenting a novel architecture called RedEx (Reduced Expander Extractor) that is as expressive as neural networks and can also be trained in a layer-wise fashion via a convex program with semi-definite constraints and optimization guarantees. We also show that RedEx provably surpasses fixed representation methods, in the sense that it can efficiently learn a family of target functions which fixed representation methods cannot.
