DP-DCAN: Differentially Private Deep Contrastive Autoencoder Network for Single-cell Clustering
Huifa Li, Jie Fu, Zhili Chen, Xiaomin Yang, Haitao Liu, Xinpeng Ling
TL;DR
DP-DCAN addresses privacy-preserving single-cell clustering by applying differential privacy to only the encoder gradient (partial network perturbation), leveraging a two-stage contrastive learning scheme to improve latent representations. The model uses a ZINB-based reconstruction loss and joint embedding-clustering objectives, achieving strong clustering performance under DP on eight real scRNA-seq datasets. A Rényi differential privacy analysis and empirical evaluations show DP-DCAN can outperform full-network perturbation baselines while requiring smaller privacy budgets, highlighting its practicality for privacy-sensitive genomics analyses. This approach offers a concrete, scalable path for deploying private deep learning models in single-cell genomics without sacrificing utility during clustering and downstream interpretation.
Abstract
Single-cell RNA sequencing (scRNA-seq) is important to transcriptomic analysis of gene expression. Recently, deep learning has facilitated the analysis of high-dimensional single-cell data. Unfortunately, deep learning models may leak sensitive information about users. As a result, Differential Privacy (DP) is increasingly used to protect privacy. However, existing DP methods usually perturb whole neural networks to achieve differential privacy, and hence result in great performance overheads. To address this challenge, in this paper, we take advantage of the uniqueness of the autoencoder that it outputs only the dimension-reduced vector in the middle of the network, and design a Differentially Private Deep Contrastive Autoencoder Network (DP-DCAN) by partial network perturbation for single-cell clustering. Since only partial network is added with noise, the performance improvement is obvious and twofold: one part of network is trained with less noise due to a bigger privacy budget, and the other part is trained without any noise. Experimental results of six datasets have verified that DP-DCAN is superior to the traditional DP scheme with whole network perturbation. Moreover, DP-DCAN demonstrates strong robustness to adversarial attacks.
