Towards generalization of drug response prediction to single cells and patients utilizing importance-aware multi-source domain transfer learning
Hui Liu, Wei Duan, Judong Luo
TL;DR
This work tackles single-cell drug response prediction by transferring knowledge from bulk-cell-line drug sensitivity data to single-cell and patient contexts through scAdaDrug, a multi-source domain adaptation framework. It introduces an adaptive importance-aware weight generator that assigns sample- and feature-level transfer weights, along with a conditional independence constraint to reduce redundancy, and couples this with adversarial domain alignment and supervised prediction. Across multiple datasets including cell lines, PDXs, and TCGA patients, the approach achieves state-of-the-art performance and demonstrates that increasing the number of source domains and using gene- and pathway-informed inputs enhance generalization. The study highlights the potential of bulk-to-single-cell transfer learning for precision oncology while acknowledging domain gaps and data sparsity challenges, and points toward domain generalization as a future direction. Overall, scAdaDrug provides a robust framework for integrating diverse data sources to predict drug response at the single-cell and patient levels, with practical implications for identifying resistant subpopulations and informing therapy choices.
Abstract
The advancement of single-cell sequencing technology has promoted the generation of a large amount of single-cell transcriptional profiles, providing unprecedented opportunities to identify drug-resistant cell subpopulations within a tumor. However, few studies have focused on drug response prediction at single-cell level, and their performance remains suboptimal. This paper proposed scAdaDrug, a novel multi-source domain adaptation model powered by adaptive importance-aware representation learning to predict drug response of individual cells. We used a shared encoder to extract domain-invariant features related to drug response from multiple source domains by utilizing adversarial domain adaptation. Particularly, we introduced a plug-and-play module to generate importance-aware and mutually independent weights, which could adaptively modulate the latent representation of each sample in element-wise manner between source and target domains. Extensive experimental results showed that our model achieved state-of-the-art performance in predicting drug response on multiple independent datasets, including single-cell datasets derived from both cell lines and patient-derived xenografts (PDX) models, as well as clinical tumor patient cohorts. Moreover, the ablation experiments demonstrated our model effectively captured the underlying patterns determining drug response from multiple source domains.
