Adversarial Attacks to Latent Representations of Distributed Neural Networks in Split Computing
Milin Zhang, Mohammad Abdi, Jonathan Ashdown, Francesco Restuccia
TL;DR
This work investigates the adversarial robustness of distributed DNNs under split computing by framing latent representations within an information-theoretic lens. Using the Information Bottleneck, it shows that robustness improves with deeper splitting points but may worsen if the latent dimension is too small due to bias, revealing a fundamental depth-dimension trade-off. The authors validate these insights through extensive experiments across six architectures, six distributed approaches, and ten attacks on ImageNet-1K, plus additional tasks, demonstrating that latent-space perturbations are consistently less effective than input-space ones, especially with bottlenecks and larger depths. The findings provide design guidance for robust, latency-aware distributed inference and open avenues for jointly optimizing accuracy, bandwidth, and adversarial resilience in edge-cloud systems.
Abstract
Distributed deep neural networks (DNNs) have been shown to reduce the computational burden of mobile devices and decrease the end-to-end inference latency in edge computing scenarios. While distributed DNNs have been studied, to the best of our knowledge, the resilience of distributed DNNs to adversarial action remains an open problem. In this paper, we fill the existing research gap by rigorously analyzing the robustness of distributed DNNs against adversarial action. We cast this problem in the context of information theory and rigorously proved that (i) the compressed latent dimension improves the robustness but also affect task-oriented performance; and (ii) the deeper splitting point enhances the robustness but also increases the computational burden. These two trade-offs provide a novel perspective to design robust distributed DNN. To test our theoretical findings, we perform extensive experimental analysis by considering 6 different DNN architectures, 6 different approaches for distributed DNN and 10 different adversarial attacks using the ImageNet-1K dataset.
