Learned Initializations for Optimizing Coordinate-Based Neural Representations
Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng
TL;DR
This work tackles the inefficiency of optimizing coordinate-based neural representations for each new signal by applying optimization-based meta-learning to learn a favorable initial weight $\theta_0^*$ for a given signal class. Using MAML or Reptile, the method learns a data-driven prior that enables faster test-time convergence and improved generalization when observations are limited. Empirical results across 2D image regression, CT reconstruction, and 3D view synthesis (ShapeNet and Phototourism) show substantial speedups and better reconstructions, including single-view 3D recovery and appearance transfer. The approach is simple to implement within existing test-time optimization pipelines and provides a principled way to inject class-specific priors into neural representations without changing network architecture."
Abstract
Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized weights for each new signal is inefficient. We propose applying standard meta-learning algorithms to learn the initial weight parameters for these fully-connected networks based on the underlying class of signals being represented (e.g., images of faces or 3D models of chairs). Despite requiring only a minor change in implementation, using these learned initial weights enables faster convergence during optimization and can serve as a strong prior over the signal class being modeled, resulting in better generalization when only partial observations of a given signal are available. We explore these benefits across a variety of tasks, including representing 2D images, reconstructing CT scans, and recovering 3D shapes and scenes from 2D image observations.
