Intrusion Tolerance for Networked Systems through Two-Level Feedback Control
Kim Hammar, Rolf Stadler
TL;DR
This work introduces TOLERANCE, a two-level feedback-control architecture for intrusion-tolerant systems that jointly optimizes local intrusion-recovery and global replication management. By mapping the two control problems to classical operations-research tasks—the machine replacement problem and the inventory replenishment problem—the authors prove that optimal policies have threshold structures and develop efficient algorithms to compute them. They implement a proof-of-concept on a three-layer testbed using a reconfigurable minBFT consensus and evaluate it against a suite of 10 intrusion types, showing substantial gains in service availability and reductions in operational cost over state-of-the-art intrusion-tolerant approaches. The work also provides a rigorous framework for analyzing recovery and replication under partial synchrony and Byzantine-style threats, and outlines directions for extending the model with game-theoretic analysis and online-learning of intrusion-detection models.
Abstract
We formulate intrusion tolerance for a system with service replicas as a two-level optimal control problem. On the local level node controllers perform intrusion recovery, and on the global level a system controller manages the replication factor. The local and global control problems can be formulated as classical problems in operations research, namely, the machine replacement problem and the inventory replenishment problem. Based on this formulation, we design TOLERANCE, a novel control architecture for intrusion-tolerant systems. We prove that the optimal control strategies on both levels have threshold structure and design efficient algorithms for computing them. We implement and evaluate TOLERANCE in an emulation environment where we run 10 types of network intrusions. The results show that TOLERANCE can improve service availability and reduce operational cost compared with state-of-the-art intrusion-tolerant systems.
