Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies

Monika Górka; Daniel Jaworek; Marek Wodzinski

Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies

Monika Górka, Daniel Jaworek, Marek Wodzinski

TL;DR

This study benchmarks multiple deep learning architectures for segmenting cancer lesions in PET/CT across head/neck and whole‑body images, focusing on one‑step and two‑step segmentation strategies. Using AutoPET and HECKTOR datasets, nnU‑Net and U‑Net/V‑Net emerge as strong performers, while UNETR shows limited gains likely due to data size, underscoring the importance of data preparation and training strategy. A key finding is that training on cancer‑positive data and employing a two‑step approach can substantially improve segmentation metrics, highlighting the practical value of targeted data curation. Overall, the work demonstrates the potential of AI to support oncological diagnostics, while also pointing to the need for larger, more diverse datasets and pretraining to fully exploit transformer architectures.

Abstract

Cancer is one of the leading causes of death globally, and early diagnosis is crucial for patient survival. Deep learning algorithms have great potential for automatic cancer analysis. Artificial intelligence has achieved high performance in recognizing and segmenting single lesions. However, diagnosing multiple lesions remains a challenge. This study examines and compares various neural network architectures and training strategies for automatically segmentation of cancer lesions using PET/CT images from the head, neck, and whole body. The authors analyzed datasets from the AutoPET and HECKTOR challenges, exploring popular single-step segmentation architectures and presenting a two-step approach. The results indicate that the V-Net and nnU-Net models were the most effective for their respective datasets. The results for the HECKTOR dataset ranged from 0.75 to 0.76 for the aggregated Dice coefficient. Eliminating cancer-free cases from the AutoPET dataset was found to improve the performance of most models. In the case of AutoPET data, the average segmentation efficiency after training only on images containing cancer lesions increased from 0.55 to 0.66 for the classic Dice coefficient and from 0.65 to 0.73 for the aggregated Dice coefficient. The research demonstrates the potential of artificial intelligence in precise oncological diagnostics and may contribute to the development of more targeted and effective cancer assessment techniques.

Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies

TL;DR

Abstract

Paper Structure (17 sections, 3 equations, 4 figures, 5 tables)

This paper contains 17 sections, 3 equations, 4 figures, 5 tables.

Introduction
Overview
Related Work
Contribution
Materials and Methods
Overview
U-Net
UNETR
V-Net
nnU-Net
Datasets
Experimental Setup
Results
AutoPET
HECKTOR
...and 2 more sections

Figures (4)

Figure 1: Diagram illustrating a implementation of two-step segmentation process with three-channel aproach.
Figure 2: Visualization of the most accurate segmentation (DSC: 0.94) for various nnU-Net configurations trained on the tumor-only dataset. The ground truth tumor is marked in blue, while the model predictions are marked in red, orange, green, and yellow.
Figure 3: Exemplary visualizations of the obtained results from the HECKTOR dataset.
Figure 4: An example of a false negative prediction case for the U-Net one-step segmentation where the model failed to detect the presence of a tumor (marked in blue in the ground truth) near the bladder.

Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies

TL;DR

Abstract

Deep Learning-Based Segmentation of Tumors in PET/CT Volumes: Benchmark of Different Architectures and Training Strategies

Authors

TL;DR

Abstract

Table of Contents

Figures (4)