StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
Robert Jewsbury, Ruoyu Wang, Abhir Bhalerao, Nasir Rajpoot, Quoc Dang Vu
TL;DR
StainFuser tackles stain normalization under domain shift in histology by reframing the task as neural style transfer using a conditional latent diffusion model. It introduces SPI-2M, a dataset with over 2 million NST-generated image pairs, to train a diffusion-based StainFuser that preserves morphology while applying target stain characteristics. Across CoNIC and Atypia-14 assessments, StainFuser outperforms handcrafted, GAN-based, and prior diffusion-based methods in image quality and downstream nuclei segmentation/classification, while delivering roughly 30× faster inference than NST. The work demonstrates strong consistency on whole-slide images and highlights practical considerations, including resolution choice, denoising steps, and data volume, for deploying diffusion-based stain normalization in clinical pathology workflows.
Abstract
Stain normalization algorithms aim to transform the color and intensity characteristics of a source multi-gigapixel histology image to match those of a target image, mitigating inconsistencies in the appearance of stains used to highlight cellular components in the images. We propose a new approach, StainFuser, which treats this problem as a style transfer task using a novel Conditional Latent Diffusion architecture, eliminating the need for handcrafted color components. With this method, we curate SPI-2M the largest stain normalization dataset to date of over 2 million histology images with neural style transfer for high-quality transformations. Trained on this data, StainFuser outperforms current state-of-the-art deep learning and handcrafted methods in terms of the quality of normalized images and in terms of downstream model performance on the CoNIC dataset.
