Passive Learning of Lattice Automata from Recurrent Neural Networks
Jaouhar Slimi, Tristan Le Gall, Augustin Lemesle
TL;DR
Passive learning of Lattice Automata from Recurrent Neural Networks tackles the challenge of extracting interpretable automata from RNNs operating over very large or infinite alphabets. The authors integrate Abstract Interpretation with Grammatical Inference to produce Interval Lattice Automata that overapproximate RNN behavior using the Box abstraction on real-valued inputs. The method builds an Interval Prefix Tree Automaton from execution traces and merges states by a similarity score to yield a finite ILA, enabling automata over infinite alphabets. Experiments on Tomita languages and an extended Tomita-2.0 benchmark show fidelity comparable to state-of-the-art while enabling infinite-alphabet automata, with efficient inference and potential for verification and interpretability in time-series and other real-world tasks.
Abstract
We present a passive automata learning algorithm that can extract automata from recurrent networks with very large or even infinite alphabets. Our method combines overapproximations from the field of Abstract Interpretation and passive automata learning from the field of Grammatical Inference. We evaluate our algorithm by first comparing it with the state-of-the-art automata extraction algorithm from Recurrent Neural Networks trained on Tomita grammars. Then, we extend these experiments to regular languages with infinite alphabets, which we propose as a novel benchmark.
