Subset-Contrastive Multi-Omics Network Embedding
Pedro Henrique da Costa Avelar, Min Wu, Sophia Tsoka
TL;DR
SCONE tackles the memory and scalability challenges of graph-based multi-omics analyses by introducing subset-contrastive learning on two overlapping subset views. Each omic view is learned with GAT-based encoders on KNN graphs, and a shared latent space is formed through pooling and reconstruction, while subset-contrastive losses align overlapping samples and decay misalignment across non-overlapping ones. The approach yields competitive or superior clustering and survival-significance results across single-cell and bulk multi-omics datasets, while reducing memory demands relative to full-graph methods. Overall, SCONE demonstrates scalable, synergistic integration of heterogeneous omics layers with potential applicability to spatial transcriptomics and other large-scale multi-omics tasks.
Abstract
Motivation: Network-based analyses of omics data are widely used, and while many of these methods have been adapted to single-cell scenarios, they often remain memory- and space-intensive. As a result, they are better suited to batch data or smaller datasets. Furthermore, the application of network-based methods in multi-omics often relies on similarity-based networks, which lack structurally-discrete topologies. This limitation may reduce the effectiveness of graph-based methods that were initially designed for topologies with better defined structures. Results: We propose Subset-Contrastive multi-Omics Network Embedding (SCONE), a method that employs contrastive learning techniques on large datasets through a scalable subgraph contrastive approach. By exploiting the pairwise similarity basis of many network-based omics methods, we transformed this characteristic into a strength, developing an approach that aims to achieve scalable and effective analysis. Our method demonstrates synergistic omics integration for cell type clustering in single-cell data. Additionally, we evaluate its performance in a bulk multi-omics integration scenario, where SCONE performs comparable to the state-of-the-art despite utilising limited views of the original data. We anticipate that our findings will motivate further research into the use of subset contrastive methods for omics data.
