Steinmetz Neural Networks for Complex-Valued Data
Shyam Venkatasubramanian, Ali Pezeshki, Vahid Tarokh
TL;DR
The paper introduces Steinmetz Neural Networks, a real-valued, multi-view architecture that processes complex-valued data via separate real and imaginary subnetworks before joint fusion, aiming for interpretable latent representations and improved generalization. It formalizes a consistency constraint linking the two latent views, and proposes the Analytic Neural Network by enforcing a Hilbert-transform-based relationship to induce orthogonality between latent components. An information-theoretic analysis suggests that such consistency can tighten generalization bounds, and the Hilbert consistency penalty is presented as a practical implementation. Empirical results on CV-MNIST, CV-CIFAR, CV-FSDD, and RASPNet show that Steinmetz and Analytic networks outperform traditional RVNNs and CVNNs, with Analytic achieving the strongest performance and enhanced noise robustness. The work highlights a promising approach for efficient, robust complex-valued data processing with potential for broader applicability and theoretical guarantees.
Abstract
We introduce a new approach to processing complex-valued data using DNNs consisting of parallel real-valued subnetworks with coupled outputs. Our proposed class of architectures, referred to as Steinmetz Neural Networks, incorporates multi-view learning to construct more interpretable representations in the latent space. Moreover, we present the Analytic Neural Network, which incorporates a consistency penalty that encourages analytic signal representations in the latent space of the Steinmetz neural network. This penalty enforces a deterministic and orthogonal relationship between the real and imaginary components. Using an information-theoretic construction, we demonstrate that the generalization gap upper bound posited by the analytic neural network is lower than that of the general class of Steinmetz neural networks. Our numerical experiments depict the improved performance and robustness to additive noise, afforded by our proposed networks on benchmark datasets and synthetic examples.
