Table of Contents
Fetching ...

Compression and Acceleration of Neural Networks for Communications

Jiajia Guo, Jinghe Wang, Chao-Kai Wen, Shi Jin, Geoffrey Ye Li

TL;DR

This article investigates how to compress and accelerate the neural networks (NNs) in communication systems and identifies some challenges on NN compression and acceleration in DL-based communications.

Abstract

Deep learning (DL) has achieved great success in signal processing and communications and has become a promising technology for future wireless communications. Existing works mainly focus on exploiting DL to improve the performance of communication systems. However, the high memory requirement and computational complexity constitute a major hurdle for the practical deployment of DL-based communications. In this article, we investigate how to compress and accelerate the neural networks (NNs) in communication systems. After introducing the deployment challenges for DL-based communication algorithms, we discuss some representative NN compression and acceleration techniques. Afterwards, two case studies for multiple-input-multiple-output (MIMO) communications, including DL-based channel state information feedback and signal detection, are presented to show the feasibility and potential of these techniques. We finally identify some challenges on NN compression and acceleration in DL-based communications and provide a guideline for subsequent research.

Compression and Acceleration of Neural Networks for Communications

TL;DR

This article investigates how to compress and accelerate the neural networks (NNs) in communication systems and identifies some challenges on NN compression and acceleration in DL-based communications.

Abstract

Deep learning (DL) has achieved great success in signal processing and communications and has become a promising technology for future wireless communications. Existing works mainly focus on exploiting DL to improve the performance of communication systems. However, the high memory requirement and computational complexity constitute a major hurdle for the practical deployment of DL-based communications. In this article, we investigate how to compress and accelerate the neural networks (NNs) in communication systems. After introducing the deployment challenges for DL-based communication algorithms, we discuss some representative NN compression and acceleration techniques. Afterwards, two case studies for multiple-input-multiple-output (MIMO) communications, including DL-based channel state information feedback and signal detection, are presented to show the feasibility and potential of these techniques. We finally identify some challenges on NN compression and acceleration in DL-based communications and provide a guideline for subsequent research.

Paper Structure

This paper contains 13 sections, 4 figures, 2 tables.

Figures (4)

  • Figure 1: Overview of the training and implementation strategy of DL-based communication algorithms.
  • Figure 2: Fire module architecture: a squeeze layer and a branching layer with $1\times1$ and $3\times3$ filtersiandola2016squeezenet.
  • Figure 3: NMSE performance and weight number comparison between ConvCsiNet and ConvSquCsiNet.
  • Figure 4: BER performance comparison between FullyCon and the pruned or quantized one.