Minimize Control Inputs for Strong Structural Controllability Using Reinforcement Learning with Graph Neural Network
Mengbang Zou, Weisi Guo, Bailu Jin
TL;DR
This work tackles the problem of minimizing control inputs to guarantee strong structural controllability (SSC) in directed networks under zero/nonzero/arbitrary structure. It reframes SSC as a graph coloring task guided by a color-change rule and solves the minimum-input problem via a reinforcement learning framework that employs a directed graph neural network in an actor-critic setup. The approach yields a general framework applicable to multiple structure types and reveals that minimal input sets tend to include nodes with low in-degree, with the required number of inputs scaling with the network's average degree. Empirical results on a social-influence network and Erdos–Renyi graphs show the RL method consistently outperforms degree-based heuristics, offering a scalable strategy for input placement in complex networks.
Abstract
Strong structural controllability (SSC) guarantees networked system with linear-invariant dynamics controllable for all numerical realizations of parameters. Current research has established algebraic and graph-theoretic conditions of SSC for zero/nonzero or zero/nonzero/arbitrary structure. One relevant practical problem is how to fully control the system with the minimal number of input signals and identify which nodes must be imposed signals. Previous work shows that this optimization problem is NP-hard and it is difficult to find the solution. To solve this problem, we formulate the graph coloring process as a Markov decision process (MDP) according to the graph-theoretical condition of SSC for both zero/nonzero and zero/nonzero/arbitrary structure. We use Actor-critic method with Directed graph neural network which represents the color information of graph to optimize MDP. Our method is validated in a social influence network with real data and different complex network models. We find that the number of input nodes is determined by the average degree of the network and the input nodes tend to select nodes with low in-degree and avoid high-degree nodes.
