Cellular automata, many-valued logic, and deep neural networks
Yani Zhang, Helmut Bölcskei
TL;DR
The paper addresses the problem of uncovering the logical rules governing cellular automata and extracting them from neural network traces. It develops a theoretical bridge between CA and Łukasiewicz MV logic, leveraging the McNaughton theorem to show that CA transition functions correspond to MV term functions, which deep ReLU networks can realize. By extending to divisible MV algebras (DMV), it accommodates rational coefficients in piecewise-linear interpolations and proves a universal NN representation for CA transitions, further embedding these into RNNs to realize CA dynamics. It also provides an algorithm to read out DMV formulas from trained networks, enabling identification of CA logic from evolution data, with a software implementation and explicit examples including retrieval of MV terms like τ = x_{-1} ⊕ x_0 ⊕ x_1. This work advances interpretable rule extraction from data-generated CA dynamics and links symbolic logic with neural architectures for dynamical systems.
Abstract
We develop a theory characterizing the fundamental capability of deep neural networks to learn, from evolution traces, the logical rules governing the behavior of cellular automata (CA). This is accomplished by first establishing a novel connection between CA and Lukasiewicz propositional logic. While binary CA have been known for decades to essentially perform operations in Boolean logic, no such relationship exists for general CA. We demonstrate that many-valued (MV) logic, specifically Lukasiewicz propositional logic, constitutes a suitable language for characterizing general CA as logical machines. This is done by interpolating CA transition functions to continuous piecewise linear functions, which, by virtue of the McNaughton theorem, yield formulae in MV logic characterizing the CA. Recognizing that deep rectified linear unit (ReLU) networks realize continuous piecewise linear functions, it follows that these formulae are naturally extracted from CA evolution traces by deep ReLU networks. A corresponding algorithm together with a software implementation is provided. Finally, we show that the dynamical behavior of CA can be realized by recurrent neural networks.
