Causal integration of chemical structures improves representations of microscopy images for morphological profiling
Yemin Yu, Neil Tenenholtz, Lester Mackey, Ying Wei, David Alvarez-Melis, Ava P. Amini, Alex X. Lu
TL;DR
This work tackles the multimodal nature of high-throughput morphological profiling by proposing MICON, a representation learning framework that integrates chemical structure information during self-supervised pre-training. MICON models compounds as treatments that generate counterfactual transformations of microscopy images and optimizes a dual-contrastive objective: a Perturbation-Aware Contrastive Learning (PaCLR) term for robust image representations and a counterfactual PaCLR term that aligns synthetic counterfactuals with real perturbation representations. Across multi-source datasets and multiple evaluation regimes, MICON outperforms hand-crafted features and existing self-supervised baselines, including in settings with unseen sources and unseen compounds, and benefits further from batch-effect correction. The findings argue for explicitly modeling perturbations in morphological profiling and demonstrate the practical potential of generating and leveraging counterfactual representations to improve robustness and generalization in multimodal cellular phenotyping.
Abstract
Recent advances in self-supervised deep learning have improved our ability to quantify cellular morphological changes in high-throughput microscopy screens, a process known as morphological profiling. However, most current methods only learn from images, despite many screens being inherently multimodal, as they involve both a chemical or genetic perturbation as well as an image-based readout. We hypothesized that incorporating chemical compound structure during self-supervised pre-training could improve learned representations of images in high-throughput microscopy screens. We introduce a representation learning framework, MICON (Molecular-Image Contrastive Learning), that models chemical compounds as treatments that induce counterfactual transformations of cell phenotypes. MICON significantly outperforms classical hand-crafted features such as CellProfiler and existing deep-learning-based representation learning methods in challenging evaluation settings where models must identify reproducible effects of drugs across independent replicates and data-generating centers. We demonstrate that incorporating chemical compound information into the learning process provides consistent improvements in our evaluation setting and that modeling compounds specifically as treatments in a causal framework outperforms approaches that directly align images and compounds in a single representation space. Our findings point to a new direction for representation learning in morphological profiling, suggesting that methods should explicitly account for the multimodal nature of microscopy screening data.
