Table of Contents
Fetching ...

Extracting Robust Register Automata from Neural Networks over Data Sequences

Chih-Duo Hong, Hongjian Jiang, Anthony W. Lin, Oliver Markgraf, Julian Parsert, Tony Tan

TL;DR

This work develops a framework for extracting robust deterministic register automata (DRAs) from neural networks that operate on data sequences, enabling interpretable, symbolically verifiable surrogates. It introduces two-head RAAs with an accumulator to model perturbation costs and proves that, for a fixed number of registers, robustness of DRA languages can be decided in polynomial time via reductions to coverability and shortest-path problems. The authors provide three learning paradigms (SMT-based, local-search, and active learning) plus a robustness-aware extraction loop that yields DRAs with statistical guarantees, verified through PAC-style equivalence checks and local δ-stability testing. Extensive experiments on RNNs and transformers across 18 languages demonstrate that the approach learns accurate surrogate DRAs and can certify or refute network robustness in a principled, distribution-aware manner. Overall, the framework bridges neural network interpretability and formal reasoning for sequential data, with practical impact on robustness analysis in time-series and related domains.

Abstract

Automata extraction is a method for synthesising interpretable surrogates for black-box neural models that can be analysed symbolically. Existing techniques assume a finite input alphabet, and thus are not directly applicable to data sequences drawn from continuous domains. We address this challenge with deterministic register automata (DRAs), which extend finite automata with registers that store and compare numeric values. Our main contribution is a framework for robust DRA extraction from black-box models: we develop a polynomial-time robustness checker for DRAs with a fixed number of registers, and combine it with passive and active automata learning algorithms. This combination yields surrogate DRAs with statistical robustness and equivalence guarantees. As a key application, we use the extracted automata to assess the robustness of neural networks: for a given sequence and distance metric, the DRA either certifies local robustness or produces a concrete counterexample. Experiments on recurrent neural networks and transformer architectures show that our framework reliably learns accurate automata and enables principled robustness evaluation. Overall, our results demonstrate that robust DRA extraction effectively bridges neural network interpretability and formal reasoning without requiring white-box access to the underlying network.

Extracting Robust Register Automata from Neural Networks over Data Sequences

TL;DR

This work develops a framework for extracting robust deterministic register automata (DRAs) from neural networks that operate on data sequences, enabling interpretable, symbolically verifiable surrogates. It introduces two-head RAAs with an accumulator to model perturbation costs and proves that, for a fixed number of registers, robustness of DRA languages can be decided in polynomial time via reductions to coverability and shortest-path problems. The authors provide three learning paradigms (SMT-based, local-search, and active learning) plus a robustness-aware extraction loop that yields DRAs with statistical guarantees, verified through PAC-style equivalence checks and local δ-stability testing. Extensive experiments on RNNs and transformers across 18 languages demonstrate that the approach learns accurate surrogate DRAs and can certify or refute network robustness in a principled, distribution-aware manner. Overall, the framework bridges neural network interpretability and formal reasoning for sequential data, with practical impact on robustness analysis in time-series and related domains.

Abstract

Automata extraction is a method for synthesising interpretable surrogates for black-box neural models that can be analysed symbolically. Existing techniques assume a finite input alphabet, and thus are not directly applicable to data sequences drawn from continuous domains. We address this challenge with deterministic register automata (DRAs), which extend finite automata with registers that store and compare numeric values. Our main contribution is a framework for robust DRA extraction from black-box models: we develop a polynomial-time robustness checker for DRAs with a fixed number of registers, and combine it with passive and active automata learning algorithms. This combination yields surrogate DRAs with statistical robustness and equivalence guarantees. As a key application, we use the extracted automata to assess the robustness of neural networks: for a given sequence and distance metric, the DRA either certifies local robustness or produces a concrete counterexample. Experiments on recurrent neural networks and transformer architectures show that our framework reliably learns accurate automata and enables principled robustness evaluation. Overall, our results demonstrate that robust DRA extraction effectively bridges neural network interpretability and formal reasoning without requiring white-box access to the underlying network.

Paper Structure

This paper contains 46 sections, 10 theorems, 36 equations, 4 figures, 5 tables, 2 algorithms.

Key Result

Theorem 1

For any fixed number of registers $k$, checking whether a $k$-DRA is robust at a given sequence is decidable in polynomial time. Moreover, whenever robustness fails, a concrete witness to non-robustness can also be constructed in polynomial time.

Figures (4)

  • Figure 1: Two S&P 500 price sequences. Red and blue triangles mark lows and highs, respectively. The left uptrend is robust to small last-value decreases, while the right uptrend is fragile.
  • Figure 2: Workflow of our robust DRA extraction pipeline: $\delta > 0$ is the perturbation radius for the robustness check; $w \in \mathbb Q^*$ is the target sequence at which robustness of the DRA $\mathcal{A}$ is verified; $B_\delta(w)$ denotes the $\delta$-neighbourhood of $w$.
  • Figure 3: A deterministic register automaton $\mathcal{A}$ that identifies the higher-highs-higher-lows price pattern.
  • Figure 4: Illustration of sample generation using Markov chain.

Theorems & Definitions (16)

  • Example 1
  • Definition 1: Robustness
  • Theorem 1
  • Definition 2: Coverability
  • Lemma 2
  • Lemma 3
  • Lemma 4
  • Lemma 5
  • Claim 1
  • Claim 2
  • ...and 6 more