SCU: An Efficient Machine Unlearning Scheme for Deep Learning Enabled Semantic Communications
Weiqi Wang, Zhiyi Tian, Chenhan Zhang, Shui Yu
TL;DR
SCU tackles data erasure in DL-enabled semantic communications by enabling joint unlearning of the encoder and decoder while preserving semantic utility. It minimizes the mutual information between the learned semantic representation and erased data $I(Z;D_e)$ and employs a contrastive compensation step that treats erased samples as negatives and the remaining data as positives during retraining. The approach is supported by theoretical analysis and experiments on MNIST, CIFAR10, and CIFAR100, showing improved unlearning effectiveness and efficiency over baselines. This work advances privacy-preserving semantic communication by providing a practical, end-to-end unlearning framework for unsupervised or self-supervised semantic models.
Abstract
Deep learning (DL) enabled semantic communications leverage DL to train encoders and decoders (codecs) to extract and recover semantic information. However, most semantic training datasets contain personal private information. Such concerns call for enormous requirements for specified data erasure from semantic codecs when previous users hope to move their data from the semantic system. {Existing machine unlearning solutions remove data contribution from trained models, yet usually in supervised sole model scenarios. These methods are infeasible in semantic communications that often need to jointly train unsupervised encoders and decoders.} In this paper, we investigate the unlearning problem in DL-enabled semantic communications and propose a semantic communication unlearning (SCU) scheme to tackle the problem. {SCU includes two key components. Firstly,} we customize the joint unlearning method for semantic codecs, including the encoder and decoder, by minimizing mutual information between the learned semantic representation and the erased samples. {Secondly,} to compensate for semantic model utility degradation caused by unlearning, we propose a contrastive compensation method, which considers the erased data as the negative samples and the remaining data as the positive samples to retrain the unlearned semantic models contrastively. Theoretical analysis and extensive experimental results on three representative datasets demonstrate the effectiveness and efficiency of our proposed methods.
