Physics-Inspired Binary Neural Networks: Interpretable Compression with Theoretical Guarantees
Arian Eamaz, Farhang Yeganegi, Mojtaba Soltanalian
TL;DR
The paper introduces Physics-Inspired Binary Neural Networks (PIBiNNs) that fuse one-bit quantization with physics-driven sparsity embedded in algorithm-unrolled solvers to compress networks dramatically while preserving essential operator geometry. It advances with two-stage quantization-aware training using a single global scale $\λ$, constructs sparse, block-diagonal weight matrices aligned with problem structure, and provides convergence guarantees for one-bit unrolling along with a local-Rademacher-based generalization bound. Empirically, PIBiNN achieves extreme compression (up to ≈$99.9\%$ reduction in parameters) with competitive accuracy on synthetic tasks and real datasets (BSD500, EEGdenoiseNet), and scales to large block-sparse configurations, significantly reducing memory and metadata overhead. The work offers a principled pathway to memory-efficient, physics-aware neural architectures, and highlights practical considerations for deployment and future system-level optimizations.
Abstract
Why rely on dense neural networks and then blindly sparsify them when prior knowledge about the problem structure is already available? Many inverse problems admit algorithm-unrolled networks that naturally encode physics and sparsity. In this work, we propose a Physics-Inspired Binary Neural Network (PIBiNN) that combines two key components: (i) data-driven one-bit quantization with a single global scale, and (ii) problem-driven sparsity predefined by physics and requiring no updates during training. This design yields compression rates below one bit per weight by exploiting structural zeros, while preserving essential operator geometry. Unlike ternary or pruning-based schemes, our approach avoids ad-hoc sparsification, reduces metadata overhead, and aligns directly with the underlying task. Experiments suggest that PIBiNN achieves advantages in both memory efficiency and generalization compared to competitive baselines such as ternary and channel-wise quantization.
