Multimodal Posterior Sampling-based Uncertainty in PD-L1 Segmentation from H&E Images
Roman Kinakh, Gonzalo R. Ríos-Muñoz, Arrate Muñoz-Barrutia
TL;DR
The paper addresses the challenge of PD-L1 biomarker assessment by inferring PD-L1-expressing regions directly from H&E histology, bypassing resource-intensive IHC. It introduces nnUNet-B, a Bayesian segmentation framework that employs Multimodal Posterior Sampling (MPS) to sample diverse checkpoints along a cyclic training trajectory of nnUNet-v2, approximating the posterior and yielding pixel-wise uncertainty maps. Inference uses an ensemble of N models to produce averaged segmentation probabilities and uncertainty measures via entropy $H$ and standard deviation $\sigma$, enabling uncertainty-aware predictions. On 1,088 paired H&E–IHC images from lung squamous cell carcinoma, nnUNet-B achieves competitive metrics ($mDice=0.805$, $mIoU=0.709$, $mHD95=97$, $mPA=0.860$) with correlated but not perfectly calibrated uncertainty, suggesting uncertainty-aware, scalable biomarker inference is feasible for clinical workflows.
Abstract
Accurate assessment of PD-L1 expression is critical for guiding immunotherapy, yet current immunohistochemistry (IHC) based methods are resource-intensive. We present nnUNet-B: a Bayesian segmentation framework that infers PD-L1 expression directly from H&E-stained histology images using Multimodal Posterior Sampling (MPS). Built upon nnUNet-v2, our method samples diverse model checkpoints during cyclic training to approximate the posterior, enabling both accurate segmentation and epistemic uncertainty estimation via entropy and standard deviation. Evaluated on a dataset of lung squamous cell carcinoma, our approach achieves competitive performance against established baselines with mean Dice Score and mean IoU of 0.805 and 0.709, respectively, while providing pixel-wise uncertainty maps. Uncertainty estimates show strong correlation with segmentation error, though calibration remains imperfect. These results suggest that uncertainty-aware H&E-based PD-L1 prediction is a promising step toward scalable, interpretable biomarker assessment in clinical workflows.
