Gene-Level Representation Learning via Interventional Style Transfer in Optical Pooled Screening
Mahtab Bigverdi, Burkhard Hockendorf, Heming Yao, Phil Hanslovsky, Romain Lopez, David Richmond
TL;DR
GRAPE presents a GAN-based framework that learns gene perturbation representations from Optical Pooled Screening images by interventional style transfer, using a trainable gene embedding $M \in \mathbb{R}^{107 \times 500}$ and a mapping network to generate perturbation-specific style codes. Through adversarial, cycle-consistency, and style reconstruction losses, GRAPE disentangles perturbation effects from nuisance content, enabling gene embeddings that cluster by CORUM-defined complexes and outperform engineered features and several baselines in clustering metrics, with competitive mAP. The approach demonstrates the viability of style-transfer-inspired representation learning for high-throughput gene function discovery, while noting limitations in perturbation breadth and the potential gains from expanding the perturbation set. Overall, GRAPE offers a scalable pathway to uncover latent gene relationships from OPS data, with practical implications for health and disease research.
Abstract
Optical pooled screening (OPS) combines automated microscopy and genetic perturbations to systematically study gene function in a scalable and cost-effective way. Leveraging the resulting data requires extracting biologically informative representations of cellular perturbation phenotypes from images. We employ a style-transfer approach to learn gene-level feature representations from images of genetically perturbed cells obtained via OPS. Our method outperforms widely used engineered features in clustering gene representations according to gene function, demonstrating its utility for uncovering latent biological relationships. This approach offers a promising alternative to investigate the role of genes in health and disease.
