A Bayesian Approach to Weakly-supervised Laparoscopic Image Segmentation
Zhou Zheng, Yuichiro Hayashi, Masahiro Oda, Takayuki Kitasaka, Kensaku Mori
TL;DR
This work tackles weakly‑supervised laparoscopic image segmentation under sparse annotations by introducing a fully Bayesian framework that models the joint distribution $p(oldsymbol{x},oldsymbol{y})$ via latent variables, enabling sampling of high‑quality pseudo‑labels and explicit uncertainty estimation. The methodology combines a conditional variational auto‑encoder with a DenseCRF‑based CRF term to maximize an ELBO objective, while using two encoder–decoder streams to reconstruct images and generate segmentation maps from latent codes and image features. Empirical results on CholecSeg8k and AutoLaparo show state‑of‑the‑art performance among scribble/weakly‑supervised methods, with further demonstration on scribble‑supervised cardiac segmentation (ACDC) indicating cross‑domain generalizability. The approach provides uncertainty quantification via MC dropout and improves robustness to sparse supervision, at the cost of higher computational demand. The work contributes a principled Bayesian formulation, extensive validation, and public code for broader adoption in label‑efficient medical image segmentation.
Abstract
In this paper, we study weakly-supervised laparoscopic image segmentation with sparse annotations. We introduce a novel Bayesian deep learning approach designed to enhance both the accuracy and interpretability of the model's segmentation, founded upon a comprehensive Bayesian framework, ensuring a robust and theoretically validated method. Our approach diverges from conventional methods that directly train using observed images and their corresponding weak annotations. Instead, we estimate the joint distribution of both images and labels given the acquired data. This facilitates the sampling of images and their high-quality pseudo-labels, enabling the training of a generalizable segmentation model. Each component of our model is expressed through probabilistic formulations, providing a coherent and interpretable structure. This probabilistic nature benefits accurate and practical learning from sparse annotations and equips our model with the ability to quantify uncertainty. Extensive evaluations with two public laparoscopic datasets demonstrated the efficacy of our method, which consistently outperformed existing methods. Furthermore, our method was adapted for scribble-supervised cardiac multi-structure segmentation, presenting competitive performance compared to previous methods. The code is available at https://github.com/MoriLabNU/Bayesian_WSS.
