Pathology Image Restoration via Mixture of Prompts
Jiangdong Cai, Yan Chen, Zhenrong Shen, Haotian Jiang, Honglin Xiong, Kai Xuan, Lichi Zhang, Qian Wang
TL;DR
This work tackles the challenge of producing high-quality all-in-focus pathology images from a single focal plane by introducing a two-stage restoration pipeline guided by a novel mixture of prompts. A coarse-stage P-former uses pathology-oriented prompts with an MoE-based gating to restore semantic content, followed by a fine-stage P-diffusion that refines high-frequency details under semantic and structural constraints from $P_P$ and $P_E$. The prompts are generated from a defocus-aware framework combining a defocus encoder and pathology foundation models, enabling robust priors even on degraded inputs. Across in-house and 3DHistech datasets, the approach achieves strong distortion and perceptual metrics and improves downstream diagnostic performance, illustrating the practical impact of integrating domain-specific priors into a two-stage restoration framework. A remaining challenge is diffusion-time, which motivates future work to accelerate inference without compromising fidelity.
Abstract
In digital pathology, acquiring all-in-focus images is essential to high-quality imaging and high-efficient clinical workflow. Traditional scanners achieve this by scanning at multiple focal planes of varying depths and then merging them, which is relatively slow and often struggles with complex tissue defocus. Recent prevailing image restoration technique provides a means to restore high-quality pathology images from scans of single focal planes. However, existing image restoration methods are inadequate, due to intricate defocus patterns in pathology images and their domain-specific semantic complexities. In this work, we devise a two-stage restoration solution cascading a transformer and a diffusion model, to benefit from their powers in preserving image fidelity and perceptual quality, respectively. We particularly propose a novel mixture of prompts for the two-stage solution. Given initial prompt that models defocus in microscopic imaging, we design two prompts that describe the high-level image semantics from pathology foundation model and the fine-grained tissue structures via edge extraction. We demonstrate that, by feeding the prompt mixture to our method, we can restore high-quality pathology images from single-focal-plane scans, implying high potentials of the mixture of prompts to clinical usage. Code will be publicly available at https://github.com/caijd2000/MoP.
