ImmuVis: Hyperconvolutional Foundation Model for Imaging Mass Cytometry
Marcin Możejko, Dawid Uchal, Krzysztof Gogolewski, Piotr Kupidura, Szymon Łukasik, Jakub Giezgała, Tomasz Nocoń, Kacper Pietrzyk, Robert Pieniuta, Mateusz Sulimowicz, Michal Orzyłowski, Tomasz Siłkowski, Karol Zagródka, Eike Staub, Ewa Szczurek
TL;DR
ImmuVis tackles the challenge of highly variable IMC marker panels by introducing marker-adaptive hyperconvolutions that condition convolutional kernels on learned marker embeddings, enabling a single foundation model to process arbitrary marker subsets without retraining. The model learns a pan-marker latent space and uses marker-conditioned hyperconvolutions in both encoder and decoder to produce per-marker reconstructions along with calibrated uncertainty via a Gaussian heteroscedastic objective. Pretrained on IMC17M, ImmuVis achieves state-of-the-art virtual staining, strong zero-shot performance across unseen panels, and competitive representations for cell typing and clinical prediction, with substantial efficiency advantages over transformer-based counterparts. The approach provides a practical, panel-flexible IMC model that delivers reliability-aware predictions and scalable deployment across cohorts, while outlining paths to extend to additional multiplex modalities and whole-slide clinical workflows.
Abstract
We present ImmuVis, an efficient convolutional foundation model for imaging mass cytometry (IMC), a high-throughput multiplex imaging technology that handles molecular marker measurements as image channels and enables large-scale spatial tissue profiling. Unlike natural images, multiplex imaging lacks a fixed channel space, as real-world marker sets vary across studies, violating a core assumption of standard vision backbones. To address this, ImmuVis introduces marker-adaptive hyperconvolutions that generate convolutional kernels from learned marker embeddings, enabling a single model to operate on arbitrary measured marker subsets without retraining. We pretrain ImmuVis on the largest to-date dataset, IMC17M (28 cohorts, 24,405 images, 265 markers, over 17M patches), using self-supervised masked reconstruction. ImmuVis outperforms SOTA baselines and ablations in virtual staining and downstream classification tasks at substantially lower compute cost than transformer-based alternatives, and is the sole model that provides calibrated uncertainty via a heteroscedastic likelihood objective. These results position ImmuVis as a practical, efficient foundation model for real-world IMC modeling.
