HEMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix Generator
Chang Bian, Beth Philips, Tim Cootes, Martin Fergie
TL;DR
HEMIT tackles the challenge of translating traditional H&E histology to multiplex mIHC images by introducing a publicly available cellularly aligned dataset and a dual-branch Pix2Pix generator that fuses CNN-based spatial detail with Swin Transformer–driven multi-scale context. The dataset pairs H&E and multi-marker mIHC (DAPI, CD3, panCK) from the same tissue section, enabling precise cell-to-cell translation and robust supervision. The proposed architecture, including a Feature Map Fusion module and cross-attention between branches, achieves superior image translation performance on SSIM, Pearson correlation, and PSNR, and is validated through downstream cell-level analyses using QuPath and Stardist. This work provides a valuable resource and benchmark for stain translation research and paves the way for biomarker inference from conventional H&E slides in oncology research.
Abstract
Computational analysis of multiplexed immunofluorescence histology data is emerging as an important method for understanding the tumour micro-environment in cancer. This work presents HEMIT, a dataset designed for translating Hematoxylin and Eosin (H&E) sections to multiplex-immunohistochemistry (mIHC) images, featuring DAPI, CD3, and panCK markers. Distinctively, HEMIT's mIHC images are multi-component and cellular-level aligned with H&E, enriching supervised stain translation tasks. To our knowledge, HEMIT is the first publicly available cellular-level aligned dataset that enables H&E to multi-target mIHC image translation. This dataset provides the computer vision community with a valuable resource to develop novel computational methods which have the potential to gain new insights from H&E slide archives. We also propose a new dual-branch generator architecture, using residual Convolutional Neural Networks (CNNs) and Swin Transformers which achieves better translation outcomes than other popular algorithms. When evaluated on HEMIT, it outperforms pix2pixHD, pix2pix, U-Net, and ResNet, achieving the highest overall score on key metrics including the Structural Similarity Index Measure (SSIM), Pearson correlation score (R), and Peak signal-to-noise Ratio (PSNR). Additionally, downstream analysis has been used to further validate the quality of the generated mIHC images. These results set a new benchmark in the field of stain translation tasks.
