Kernel-Based Testing for Single-Cell Differential Analysis
Anthony Ozier-Lafontaine, Camille Fourneaux, Ghislain Durif, Polina Arsenteva, Céline Vallot, Olivier Gandrillon, Sandrine Giraud, Bertrand Michel, Franck Picard
TL;DR
Kernel-based testing addresses the challenge of differential analysis in single-cell data by comparing full cell-wise distributions rather than univariate means. The authors develop two complementary tests based on weighted kernel-mean embeddings and Kernel Fisher Discriminant Analysis, with bandwidth selection and a zero-inflated extension for scRNA-Seq. They validate the approach on simulations and multiple public scRNA-Seq datasets, showing calibrated type-I error, strong power to detect non-linear alternatives, and competitive performance with existing methods; they also apply it to scChIP-Seq to reveal epigenomic heterogeneity and persister-like subpopulations. The results demonstrate the method's ability to uncover subtle population heterogeneities and to identify candidate subpopulations and epigenomic states, offering a flexible framework for multi-group designs in single-cell analysis.
Abstract
Single-cell technologies offer insights into molecular feature distributions, but comparing them poses challenges. We propose a kernel-testing framework for non-linear cell-wise distribution comparison, analyzing gene expression and epigenomic modifications. Our method allows feature-wise and global transcriptome/epigenome comparisons, revealing cell population heterogeneities. Using a classifier based on embedding variability, we identify transitions in cell states, overcoming limitations of traditional single-cell analysis. Applied to single-cell ChIP-Seq data, our approach identifies untreated breast cancer cells with an epigenomic profile resembling persister cells. This demonstrates the effectiveness of kernel testing in uncovering subtle population variations that might be missed by other methods.
