3D Medical Imaging Segmentation on Non-Contrast CT
Canxuan Gang, Yuhan Peng
TL;DR
Non-contrast CT (NCCT) segmentation in 3D medical imaging is challenged by limited ground-truth data and the need for accurate volumetric labeling. The paper surveys CNN-based and Transformer-based segmentation methods, analyzes their strengths and weaknesses, and advocates a fully convolutional-free Transformer encoder–MLP decoder pipeline to enhance global context modeling and mask generation. It highlights nnUNet as the current benchmark SOTA, discusses thin-thick slice domain adaptation, and proposes releasing a thin-slice multi-semantic NCCT dataset alongside self-supervised and diffusion-based pretraining strategies. The work aims to advance presurgical simulation and planning by improving segmentation accuracy and robustness, and it outlines future directions for handling long-tail data and domain-specific pretraining in medical imaging.
Abstract
This technical report analyzes non-contrast CT image segmentation in computer vision. It revisits a proposed method, examines the background of non-contrast CT imaging, and highlights the significance of segmentation. The study reviews representative methods, including convolutional-based and CNN-Transformer hybrid approaches, discussing their contributions, advantages, and limitations. The nnUNet stands out as the state-of-the-art method across various segmentation tasks. The report explores the relationship between the proposed method and existing approaches, emphasizing the role of global context modeling in semantic labeling and mask generation. Future directions include addressing the long-tail problem, utilizing pre-trained models for medical imaging, and exploring self-supervised or contrastive pre-training techniques. This report offers insights into non-contrast CT image segmentation and potential advancements in the field.
