Pseudo-Invertible Neural Networks
Yamit Ehrlich, Nimrod Berman, Assaf Shocher
TL;DR
This work generalizes the Moore-Penrose pseudo-inverse to non-linear mappings by enforcing the first two Penrose identities and using bijective completion to select a unique non-linear PInv. It introduces Surjective PInv Neural Networks (SPNNs), a dimension-reducing architecture with a built-in PInv guaranteed by a two-phase training scheme and an auxiliary inverse network. The framework enables Non-Linear Back-Projection (NLBP), unifying linear null-space projection and gradient-based methods to perform zero-shot restoration of non-linear degradations, including semantic and attribute-level control in diffusion-generated images. By integrating SPNNs with a diffusion prior, the approach achieves stable, semantically constrained inversion without retraining the prior, with demonstrated capability on semantic reconstruction and attribute-controlled generation. The work provides a principled mathematical bridge for solving complex non-linear inverse problems in scientific computing and image synthesis.
Abstract
The Moore-Penrose Pseudo-inverse (PInv) serves as the fundamental solution for linear systems. In this paper, we propose a natural generalization of PInv to the nonlinear regime in general and to neural networks in particular. We introduce Surjective Pseudo-invertible Neural Networks (SPNN), a class of architectures explicitly designed to admit a tractable non-linear PInv. The proposed non-linear PInv and its implementation in SPNN satisfy fundamental geometric properties. One such property is null-space projection or "Back-Projection", $x' = x + A^\dagger(y-Ax)$, which moves a sample $x$ to its closest consistent state $x'$ satisfying $Ax=y$. We formalize Non-Linear Back-Projection (NLBP), a method that guarantees the same consistency constraint for non-linear mappings $f(x)=y$ via our defined PInv. We leverage SPNNs to expand the scope of zero-shot inverse problems. Diffusion-based null-space projection has revolutionized zero-shot solving for linear inverse problems by exploiting closed-form back-projection. We extend this method to non-linear degradations. Here, "degradation" is broadly generalized to include any non-linear loss of information, spanning from optical distortions to semantic abstractions like classification. This approach enables zero-shot inversion of complex degradations and allows precise semantic control over generative outputs without retraining the diffusion prior.
