SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
Nir Barel, Ron Shapira Weber, Nir Mualem, Shahaf E. Finder, Oren Freifeld
TL;DR
SpaceJAM tackles unsupervised joint alignment (JA) by eliminating the need for atlas maintenance and regularization, delivering a lightweight 16K-parameter model that trains and runs an order of magnitude faster than prior methods. It achieves this with a novel inverse-compositional loss built on a Lie-algebra–parameterized sequence of small, invertible warps within a shared latent feature space, plus a compact preprocessing pipeline (PCA+autoencoder) to produce $U_i$ representations. Key contributions include the Lie-group–aware IC-STN architecture, a curriculum that transitions from $\mathrm{SE}(2)$ to full homographies, and an effective flips-handling mechanism, all of which yield competitive PCK@0.10 scores on SPair-71K and CUB while offering substantial speedups and reduced parameter count. The approach enables efficient, robust JA suitable for weakly supervised settings and broad applicability, with code and models made publicly available.
Abstract
The unsupervised task of Joint Alignment (JA) of images is beset by challenges such as high complexity, geometric distortions, and convergence to poor local or even global optima. Although Vision Transformers (ViT) have recently provided valuable features for JA, they fall short of fully addressing these issues. Consequently, researchers frequently depend on expensive models and numerous regularization terms, resulting in long training times and challenging hyperparameter tuning. We introduce the Spatial Joint Alignment Model (SpaceJAM), a novel approach that addresses the JA task with efficiency and simplicity. SpaceJAM leverages a compact architecture with only 16K trainable parameters and uniquely operates without the need for regularization or atlas maintenance. Evaluations on SPair-71K and CUB datasets demonstrate that SpaceJAM matches the alignment capabilities of existing methods while significantly reducing computational demands and achieving at least a 10x speedup. SpaceJAM sets a new standard for rapid and effective image alignment, making the process more accessible and efficient. Our code is available at: https://bgu-cs-vil.github.io/SpaceJAM/.
