ReLU Neural Networks of Polynomial Size for Exact Maximum Flow Computation
Christoph Hertrich, Leon Sering
TL;DR
The paper investigates the exact expressivity of ReLU neural networks as a real-valued computation model by introducing Max-Affine Arithmetic Programs (MAAPs) and proving their equivalence to NNs in depth, width, and size. It then uses this framework to construct polynomial-size networks that solve two classic combinatorial optimization problems: the minimum spanning tree value for undirected graphs with $n$ nodes via a network of size $O(n^3)$ and depth $O(n\log n)$, and the maximum $s$-$t$ flow for directed graphs with $n$ nodes and $m$ arcs via a network of size $O(m^2n^2)$ and width $O(1)$. These results show that such problems admit strongly polynomial-time algorithms built from affine transformations and maxima, without branching. The authors also discuss implications for learning theory, Boolean and tropical circuit models, and parallel computation, and they outline future research directions, including potential lower bounds and extensions to additional CO problems. Overall, the work advances the understanding of how polynomial-size ReLU networks can represent exact solutions to key optimization problems and how MAAPs can serve as a practical bridge between algorithm design and neural-network constructions.
Abstract
This paper studies the expressive power of artificial neural networks with rectified linear units. In order to study them as a model of real-valued computation, we introduce the concept of Max-Affine Arithmetic Programs and show equivalence between them and neural networks concerning natural complexity measures. We then use this result to show that two fundamental combinatorial optimization problems can be solved with polynomial-size neural networks. First, we show that for any undirected graph with $n$ nodes, there is a neural network (with fixed weights and biases) of size $\mathcal{O}(n^3)$ that takes the edge weights as input and computes the value of a minimum spanning tree of the graph. Second, we show that for any directed graph with $n$ nodes and $m$ arcs, there is a neural network of size $\mathcal{O}(m^2n^2)$ that takes the arc capacities as input and computes a maximum flow. Our results imply that these two problems can be solved with strongly polynomial time algorithms that solely use affine transformations and maxima computations, but no comparison-based branchings.
