Table of Contents
Fetching ...

High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing Bottleneck

Hailin Zhong, Donglong Chen

TL;DR

HECS-B addresses the bandwidth and latency bottlenecks in edge–cloud split computing by introducing a High-Efficiency Compressed Sensing Bottleneck that embeds a compressed sensing autoencoder into the shallow DNN layer and leverages knowledge distillation for end-to-end training. It formulates a mutual-information objective $I_ppa(X,Y)$ with a variational lower bound $\mathbb{E}_{Q_ppa(X,Y)}[\log p_ heta(X|Y)]$, coupled with a trainable prior $p_ppa(z)$ to preserve informative features during compression. The architecture integrates a bottleneck through $q_ heta(z|x)$ and $p_ppa(h|z)$, optimized with KD-based losses and reparameterization to enable distributed inference across edge devices and the cloud. Empirical results on MNIST/Omniglot and ImageNet with a ResNet-50 backbone reveal about 50% bandwidth reduction and 60% faster computation while maintaining accuracy, demonstrating practical applicability for scalable edge–cloud AI.

Abstract

The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy. While edge computing mitigates latency and privacy concerns, its scalability is constrained by the resources of edge devices, thus prompting the adoption of split computing (SC) addresses these limitations. However, SC faces challenges in (1) efficient data transmission under bandwidth constraints and (2) balancing accuracy with real-time performance. To tackle these challenges, we propose a novel split computing architecture inspired by compressed sensing (CS) theory. At its core is the High-Efficiency Compressed Sensing Bottleneck (HECS-B), which incorporates an efficient compressed sensing autoencoder into the shallow layer of a deep neural network (DNN) to create a bottleneck layer using the knowledge distillation method. This bottleneck splits the DNN into a distributed model while efficiently compressing intermediate feature data, preserving critical information for seamless reconstruction in the cloud. Through rigorous theoretical analysis and extensive experimental validation in both simulated and real-world settings, we demonstrate the effectiveness of the proposed approach. Compared to state-of-the-art methods, our architecture reduces bandwidth utilization by 50%, maintains high accuracy, and achieves a 60% speed-up in computational efficiency. The results highlight significant improvements in bandwidth efficiency, processing speed, and model accuracy, underscoring the potential of HECS-B to bridge the gap between resource-constrained edge devices and computationally intensive cloud services.

High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing Bottleneck

TL;DR

HECS-B addresses the bandwidth and latency bottlenecks in edge–cloud split computing by introducing a High-Efficiency Compressed Sensing Bottleneck that embeds a compressed sensing autoencoder into the shallow DNN layer and leverages knowledge distillation for end-to-end training. It formulates a mutual-information objective with a variational lower bound , coupled with a trainable prior to preserve informative features during compression. The architecture integrates a bottleneck through and , optimized with KD-based losses and reparameterization to enable distributed inference across edge devices and the cloud. Empirical results on MNIST/Omniglot and ImageNet with a ResNet-50 backbone reveal about 50% bandwidth reduction and 60% faster computation while maintaining accuracy, demonstrating practical applicability for scalable edge–cloud AI.

Abstract

The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy. While edge computing mitigates latency and privacy concerns, its scalability is constrained by the resources of edge devices, thus prompting the adoption of split computing (SC) addresses these limitations. However, SC faces challenges in (1) efficient data transmission under bandwidth constraints and (2) balancing accuracy with real-time performance. To tackle these challenges, we propose a novel split computing architecture inspired by compressed sensing (CS) theory. At its core is the High-Efficiency Compressed Sensing Bottleneck (HECS-B), which incorporates an efficient compressed sensing autoencoder into the shallow layer of a deep neural network (DNN) to create a bottleneck layer using the knowledge distillation method. This bottleneck splits the DNN into a distributed model while efficiently compressing intermediate feature data, preserving critical information for seamless reconstruction in the cloud. Through rigorous theoretical analysis and extensive experimental validation in both simulated and real-world settings, we demonstrate the effectiveness of the proposed approach. Compared to state-of-the-art methods, our architecture reduces bandwidth utilization by 50%, maintains high accuracy, and achieves a 60% speed-up in computational efficiency. The results highlight significant improvements in bandwidth efficiency, processing speed, and model accuracy, underscoring the potential of HECS-B to bridge the gap between resource-constrained edge devices and computationally intensive cloud services.

Paper Structure

This paper contains 33 sections, 16 equations, 8 figures, 2 tables.

Figures (8)

  • Figure 1: Edge computing structure.
  • Figure 2: Split computing structure
  • Figure 3: Split computing without bottleneck injection
  • Figure 4: Split computing with bottleneck injection
  • Figure 5: Original(teacher) model and target(student) model structure of KD
  • ...and 3 more figures